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FOREWditl) 


The present Monograph is based on an earlier Me^^S^dnm produced by 
the Directorate of Ordnance Factories (Explosives) for the use, primarily, of 
those concerned with pilot plant and plant scale experiments on chemical manu- 
facturing processes in the Royal Ordnance Factories (Explosives). Much work 
of this type was being carried out and it had become evident that it was desirable 
for the results of such experiments to be subjected to critical tests of significance. 
A convenient account of the straightforward tests of significance, written from 
the point of view of the individual who has to apply them in practice without 
necessarily a full knowledge of their theoretical background, was not readily 
available, and an attempt was therefore made to prepare one. 

It was evident that to apply tests of significance conveniently and economically 
the experiments had to be planned in appropriate forms. It is considered that 
the methods outlined should be as much a standard tool of the industrial experi- 
menter as a chemical balance is of the laboratory experimenter. In carrying out 
an industrial experiment the choice is not between using a statistical design with 
the application of the appropriate tests of significance or the ordinary methods : 
the choice is between correct or incorrect methods. Even the simplest experiment 
requires an estimate of the significance of its results. 

The practice sometimes followed of consulting the statistician only after the 
experiment is completed and asking him “what he can make of the results^’ cannot 
be too strongly condemned. It is essential to have the experiment in a form 
suitable for analysis and in general this can only be attained by designing the 
experiment in consultation with the statistician, or with due regard to the 
statistical principles involved. 

The present Monograph, therefore, is intended to be a guide to both the 
planning and the interpretation of experiments on the industrial scale, and it is 
hoped that the methods described will become part of the everyday technique 
to those who carry out such experiments. 

R. C. BOWDEN, 

Director of Ordnance Factories (Explosives) 
Ministry of Supply. 
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FROM THE PREFACE TO THE FIRST EDITION 

The present monograph is intended to provide for those who carry out 
investigational work a guide to modern statistical methods, both the use of tests 
of significance to attain reliability in deductions from experimental data and the 
use of statistical design to attain the maximum precision with the minimum 
expenditure* 

The subject is treated entirely from the practical point of view.(^) Theory 
is at a minimum and the only mathematics involved is simple arithmetic. Each 
statistical method discussed is illustrated with practical examples worked out in 
detail to show what exactly is involved in its use. Some of the more advanced 
techniques may appear rather formidable, but it is strongly recommended that 
they should be approached through the use of the simpler methods. A little 
practice in these will lead to a familiarity and confidence with which it will be 
realized that the advanced techniques are merely comparatively easy extensions 
of the simpler. 

The theoretical background on which the author has drawn is of course that 
largely developed by Professor R. A. Fisher, expounded in his '‘Statistical Methods 
for Research Workers’* and “The Design of Experiments.” 

Indebtedness is expressed to Prof. R. A. Fisher and Dr. Frank A. Yates for 
permission to reprint Tables III-VI from their book “Statistical Tables for 
Biological, Agricultural, and Medical Research” (Oliver & Boyd, Edinburgh and 
London), and to the British Standards Institution for permission to reprint certain 
factors for Quality Control Charts from B.S. 600R, “Quality Control Charts.” 

The examples used here are all the results of experimental and investigational 
work in the Royal Ordnance Factories (Explosives), and the author’s thanks are 
due to Mr. A. C. H. Cairns, Mr. A. H. Woodhead, Mr. D. Newman and others 
whose collaboration has been invaluable in the development of the ideas and 
outlook presented here. The author is also indebted to Dr. R. C. Bowden, 
Director of Ordnance Factories (Explosives), for his active interest and encourage* 
ment throughout this work. 

K. A. B. 


PREFACE TO THE SECOND EDITION 

The principal additions to this Edition are a substantial enlargement of 
Chapter I and two new chapters, Chapter XIII on balanced incomplete blocks 
amd Chapter XIV on confounding. Further additions are the components of 
variance for unequal column size in Chapter VII (d), the exact formula for the 
residual variance about a regression line in Chapter IX (h), the Doolittle method 
of computation in multiple regression in Chapter X (d), and the partitioning of 
sums of squares in Chapter XII (c). 

Minor additions have also been made at many points. 


K. A. B. 


(^)The reader will find the mathematical theory of the simple statistical tests well dealt 
with by A. Linder : “Statistiche Methoden fur Naturwissenschilfter, Mediziner, und 
Ingenieure*" (Verlag Birkhauser, Basel). Full and comprehensive treatments are by M. G. 
Kendall : “The Advanced Theory of Statistics” (Griffin) and S. S. Wilkes : “Mathematical 
Statistics” (Princeton). 
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CHAPTER I 


INTRODUCTION 

(a) Experimental Error 

In all experimental work our observations are always subject to experimental 
error. No experimenter would seriously dispute this statement. This makes it 
all the more remarkable that only recently has it begim to be generally admitted 
that in order to judge the trustworthiness of any experimental result it must be 
compared with an estimate of its error, an operation known to the statistician as 
a * *test of signi ficance.” It can be said at once that the lack of this test of signi- 
ficance is the main origin of the unsatisfactoty nature of most industrial experi- 
mentation : the errors are large, larger than is realised, and the apparent effects,., 
attributed by the experimenter to such factors as he has varied, are in reality 
arising solely through the accidental fluctuations due to these errors. If rigorous, 
tests of significance were applied, many of the assertions made would be shown, 
to be based on quite inadequate evidence and many would also be found to be 
fallacious (the fact that an assertion has been made on inadequate evidence of 
course does not imply that that assertion is incorrect it may turn out that the 
asserter was lucky in his gamble. However, the plant manager in taking the 
advice of his experimenter does want to know whether he is acting on a scientifically 
proved fact or on a subjective guess). 

(b) Classical and Industrial Experimentation 

In this question of the importance of error we have one of the principal 
differences between laboratory and industrial, or plant scale, experimentation* 
The laboratory experimenter is generally in the fortunate position of being able 
to have all his independent variables under complete control : his materi^ can 
be of the highest possible purity : his measuring apparatus can all be of the highest 
accuracy, and usually his errors are small. 

In referring above to the laboratory experimenter, we meant particularly 
those working on physical-chemical systems. Any work involving living matter, 
bacteria, fungi, etc., is always found to have relatively large variability outside 
the control of the experimenter, and as far as the magnitude of his experimental 
error is concerned he is more in the position of the industrial experimenter even 
if he is working on the laboratory scale. 

The industrial experimenter, on the other hand, often cannot obtain complete 
control of all his variables, for frequently the scale of the experiment is so vast 
that an army of supervisors would be required to keep them under control. The 
attainment of good control may also be a difficult technical matter : thus it is 
generally easy in a well-equipped laboratory to control temperatures within very 
fine limits with the usual thermostatic equipment. For a plant scale experiment 
this is generally quite impracticable. Again, it is easy enough to provide excellent 
lagging for any laboratory apparatus : to lag an absorption column 50 feet high, 
on the other hand, may be absurdly expensive. 

Further, for the industrial experimenter it may be in practice essential to 
carry out the investigation with only the slightest hindrance to normal production. 
Thus it might be possible to obtain control of the independent variables, but only 
through tafing such time and care on each batch that the throughput through the 
plant would be seriously reduced. Or we might wish to vary some of the inde- 
pendent variables over a wide range, in order to get points spaced well apart, so 
that their effect on the dependent variable would show up clearly : this might 
lead to the production of a large amount of out-of-specification scrap material. 
It would thus be necessary to work in a narrow range of the independent variable, 
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which means that the magnitudes of the effects that we have to observe will be 
much reduced and may well be swamped by errors. 

These factors result in industrial experiments generally havii^ a much larger 
error than laboratory work. Under these conditions it is difficult to decide 
whether a particular result is genuine or due to the error, and this calls for a 
statistical test of significance. Further, in order to get a significant result (i.e. a 
result in which the apparent effect is significantly greater than the error in its 
determination) we need a larger number of experiments (and this is precisely 
what is so undesirable in industrial work on account of the expense) unless we 
use a number of special statistical devices. 

Finally, it might not be out of place to remark that it has been the author's 
experience that even in laboratory work the errors, when calculated objectively 
with exact statistical technique, are often much larger than the experimenter 
believes or claims them to be. 

It is clear from the above considerations that the industrial experimenter has 
to face a set of problems additional to those of the laboratory experimenter, and 
hence requires an additional set of techniques. These have been developed in 
recent years by research workers in agriculture (notably R. A. Fisher and his 
colleagues at the Rothamsted Experiment Station) who have to contend with the 
weather and with variations in the fertility of the soil amongst the variables outside 
their control. 

(c) Replication 

If the error of our estimate of an effect is too great, generally there are two 
alternative methods of reducing it. One is to refine the technique, e.g. by using 
more precise thermostatic equipment, weighing on better balances, taking more 
meticulous precaution against evaporation, etc. Alternatively, and often on the 
industrial scale this is the only alternative, we can replicate (repeat) the experiment 
a number of times and take the average of the results. Unfortunately this process 
is not very efficient, for the error of the average is inversely proportional to the 
square root of the number of observations. Thus averaging four observations 
brings the error of the average down to one-half that of a single observation : 
the average of sixteen observations has one-quarter of the error of a single 
observation. 

On the laboratory scale, experiments are relatively cheap, and we can afford 
to take many more observations than are strictly necessary to establish a given 
conclusion with a given level of significance. We can easily take so many that 
the error of the average, if it ever was very high, has become so small that an 
exact test of significance is obviously unnecessary. On the industrial scale, 
however, the work is so expensive, involving a large consumption of supervisory 
man-power, raw materials, power, etc., that to take more observations than would 
suffice for our purpose, defined above as the establishing of a given conclusion 
with a given level of significance, woxild be scandalously extravagant. (^) 

(d) Experimental Design : Randomised Blocks 

Statistical theory has so far only been mentioned in connection with the 
theory of errors, to which of course it is fundamental. It has, however, a great 
deal to say on how experiments should be designed so as to allow the obtaining 
of an unbiassed estimate of error and to allow the error to be at a minimum. 

Let us start with one of the simplest conceptions in experimental design, the 
so-called ‘‘Randomised .Block." 

(^) This aspect of statistics has been developed considerably recently ; c.f. '^Sequential 
.Analysis of Statistical Data : Applications" : Statistical Research Group. Columbia 
University : Columbia UniversiW Press, and G, A. Barnard : "Sequential Tests m Industrial 
Statistics" (Supplement to the Journal of the Royal Statistical Society, Vol. VIII, No. 1, 
p. 1, 1946). 
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Suppose we wish to compare four “treatments.” We are using “treatme^t^* 
in a generalised sense : the foiu: treatments may e.g. be literally four fertiliser 
treatments for wheat, or four diets for rats, or four temperatures for a nitratioii 
process, or four concentration of plasticiser in a plastic. Further suppose that 
the material to which the treatments are applied arrives in batches la^ enoiigh 
to allow only four experiments per batch. We will suppose that it is probable 
that successive batches differ appreciably in their quality. We will these 
batches by the general term “blodb.” In the wheat example quoted above, four 
adjacent plots of ground may be considered as the “block” : they are more likdy 
to resemble each other in their fertility than other plots belonging to other blocks 
which will be further away, possibly in a different field. In the case of the rgts, 
it is probable that rats from the same litter resemble each offier more than rats 
from other litters. In the case of the nitration process, a tank full of well-stin^ 
acid and a batch of material to be nitrated mixed thoroughly tiU k is homogeneous 
would be the equivalent of the “block.” Further suppose that we are going to 
replicate four times (that is, carry out each observation four timesf*)). If the 
Treatments are A, B, C, and D and the Blocks are 1, 2, 3 and 4 we could ^ecute 
the experiment as in Table 1.1. 

Table 1.1 


Block 

Treatment 

1 

A 

A 

A 

A 

2 

B 

B 

B 

B 

3 

C 

C 

C 

C 

4 * 

D 

D 

D 

D 


A rnoment's thought will show Aat a scheme suph as iu J4 is 

okmously abated, for whatever show ^e as likely to be due to 

dffl^eencbs between blocks as to j^fi^ences be^e^ freatoqepts. 

A siighriy better procedure would be to di^rtoi^Ee ilite ^eatments completely 
at random. We can labd 16 cards A, % C, © e^ jhKa^^em in a 
and withdraw them one by one, writer down m r^jf m 

:succe8sive blodcs. The result of one such i^pet^kion m in IMtle 1*2. 


Table 1.2 


Block 

Treatmmts 

1 

D 

B 

B 

B 

2 

C 

D 

A 

B 

3 

B 

A 

C 

A 

4 

C 

c 

D 

A 


M we take the average of the four results for each treatment they wiQ be 
statistically soimd if the randomi^on wa© properly jKtrried out, but the ^ror m 
each mean (average) will be infiated because it includes in itself the diSereaces 
between bkx^ The ei^eriment will thus not be as accurate as it n%iit be- 
lt would obviously & better to allocate each treatment once to 
asm Table 1.3. , 

It will be convenient if we decide that a “single r^heation*’ measip: ^ eaip^M 
x:amed ont once, “double t^lication” means an Mq>einaiijeut out fouo- 

11 










Table 1.3 


Block 

Treatment 

1 

A 

B 

C 

D 

2 

A 

B 

C 

D 

3 

A 

B 

C 

D 

4 

A 

B 

C 

D 


When we take the average of the four A% and compare it with the average 
of, say, the four B’s, the difference between the average is completely independent 
of any block differences. 

It might, however, be more satisfactory if the order of carrying out the experi- 
ments were randomised (as they stand in Tablejf^ it is systematic, the order being 
A, B, C, D in each block). If there was any tendency for the first experiment in 
each case to be high, then this would appear as a high result for A which would 
be fallacious. Randomisation, such as has been carried out in Table 1.4, would 
avoid this. 

Table 1.4 


Block 

Treatment 

1 

D 

B 

A 

C 

2 

B 

C 

A 

D 

3 

C 

B 

A 

D 

4 

A 

c 

D 

B 


We have now arrived at the concept of the “randomised block.” All treat- 
ments to be compared occur an equal number of times in each block, the block 
being large enough to take all treatments at least once. The replication is achieved 
by repeating the experiment on further blocks, the order of experiments always 
being randomised. Of course, if the size of the block is such that several replica- 
tions can be placed in a single block, then this can be done and nothing is gained 
by replicating in different blocks. (®) 

The analysis of the results of such an experiment, with the testing of the 
significance of the results, is discussed in Chapter VII (e). 

(e) The Latin Square 

The hypothetical experiment discussed in the previous section, though now 
rigorous in design in that it cannot lead to false conclusions, is in some cases 
capable of further refinement. 

To particularise slightly, suppose that in a multi-stage process batches of a 
ton are split into four sub-batches which are processed simultaneously (approxi- 
mately) in four reactors, and we wish to compare the effects of four different 
treatments at this latter stage. With the four treatments on the four sub-batches 
in each batch we have the randomised block technique, the allocation of the four 
treatments to the four reactors being decided by randomisation. It is possible, 
however, to arrange that with such treatment each of the four reactors occurs 
once and once only, so that the treatment averages are completely independent 
of any possible differences between reactors. Such an arrangement is that ffi 
Table 1.5, where the letters refer to the treatments, 

(®) See, however, the end of Chapter XII (e). 
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Table 1.5 


Batch 

Reactor 1 

Reactor 2 

Reactor 3 

Reactor 4 

1 

A 

B 

c 

D 

2 

C 

D 

A 

B 

3 

B 

A 

D * 

C 

4 

D 

C 

B 

A 


Examining Table 1.5, it will be seen that the four sub-batches of each batch 
include all four treatments. At the same time, with eacli of the four treatments 
all four reactors occur once and only once and all four batches occur once and 
only once. Possible errors due to differences between Batches and between 
Reactors are thus eliminated both from the averages, thus allowing unbiassed 
estimates to be obtained, and also from the error, thus making the latter a minimum. 

The computation of the results of a Latin Square is discussed in Chapter 
Xll.(g). 

(f) Balanced Incomplete Blocks 

In the ordinary Randomised Block the block must be large enough to take 
one complete set of treatments. It may happen, however, that we wish to compare 
more treatments than can be got into one block. 

Suppose we have raw material in batches that are only large enough to allow 
4 experiments to be carried out on one batch, and further suppose that there is an 
appreciable variance between batches. If we wish to carry out a comparison of 
2, 3, or 4 treatments this between batch variance would not worry us, as each set of 
2, 3 or 4 treatments could be carried out on the same batch. If, however, we 
wished to compare 5 or more treatments, then one or more of these* would have 
to be carried out on a different batch from that on which the first 4 treatments 
were tested, and the comparison would be invalidated by the confusing element 
of the between batch variance. The standard method of avoidiag this difficulty 
is to select oije of the treatments as a ‘‘standard,’’ and include it in each batch. 
All treatments are then compared directly with the standard, and of the other 
comparison some are direct through the treatments occurring in the same block 
and others are indirect, from treatment M to the standard in its block to the 
standard in the other block concerned to the treatment N. 

Although this procedure does work it is by no means very satisfactory, as 
there are three different errors for the three different types of comparison. Further, 
it is not the most efficient possible. 

Consider the alternative design in Table 1.6 where we have the five treatments 
replicated four times in five batches of four. 


Table 1.6 


Batch Number 

1 

2 

3 

4 

5 

A 

A 

A 

A 

B 

B 

B 

B 

C 

c 

C 

c 

D 

D 

D 

D 

E 

E 

E 

E 
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There are a number of points about this design : — 

(a) Each treatment occurs once and only once in 4 of the 5 batches. 

(b) Any specified pair of treatments, e,g. A and B, or C and E, occur together 
in 3 of the 5 batches. Thus direct comparisons between B and D are 
possible in Batches 1, 3 and 5. 

(c) Direct comparisons between a specified pair of treatments is not possible 
in the other 2 batches j thus there is no 1) in Batch 2 and no B in batch 4. 
Nevertheless a very satisfactory comparison can be made, for A, C and E 
occur in both Batches 2 and 4 and we can in effect use the average of 
these three as a “standard.” It will be noted that to use the average of 
three as a standard is more satisfactory than using a single individual. 

These experimental designs are known as balanced incomplete blocks. They 
are balanced because each treatment occurs to exactly the same extent : they are 
incomplete because no block contains the full number of treatments. They suffer 
from the restriction that balanced arrangements are not possible for all experi- 
mental set-ups. Broadly speaking, if we fix the number of treatments that we 
wish to compare, and if the number of experiments per batch (or “block”) is also 
fixed, then the number of replication of each treatment is thereby determined. 
This is the principal disadvantage of these designs : the number of replications 
they require may be greater than we think are necessary to attain sufficient accuracy. 

Some of the designs are more satisfactory from this point of view than others. 
In general, for a given number of treatments, the larger the block we can use the 
smaller the number of replications that are required to establish balance. Thus 
if we have 21 treatments to be compared, then working with blocks of 3 we require 
10 replications. Working with blocks of 4 we could use the scheme for 25 treat- 
ments (duplicating 4 of the most important if we wished to bring the total up to 25) 
which would require 8 replications. Blocks of 5 are a very satisfactory size as 
only 5 replications are required. It is a curious property that blocks of 6 and 7 
are less desirable, requiring 8 and 10 replications respectively. The exact require- 
ments in any particular instance are a consequence of the combinatorial properties 
of numbers. 

The computation for these designs is discussed in Chapter XIII. 

(g) Lattice Squares 

The outstanding disadvantages of the Balanced Incomplete Blocks is that to 
obtain balance the number of replications necessary becomes rather high as the 
number of treatments increases. Generally, p^ treatments require (p + 1) repli- 
cations, e.g, 36 treatments require 7 replications, and this we may consider to 
be more than is justified. 

The Lattice Square is probably the most suitable device for this situation. 
Suppose we have 25 treatments which we wish to compare. We can write them 
down in the form of a square as in Table 1.7. 

Table 1.7 


11 

12 

13 

14 

15 

21 

22 

23 

24 

25 

31 

32 

33 

34 

35 

41 

42 

43 

44 

45 

SI 

52 

S3 

54 

55 


These 25 treatments can be tested in blocks of 5 by taking first the rows as 
blocks and then the columns as blocks. Repeating each set gives a total of 4 
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replications for each treatment. It will be noted that these designs are not 
balanced. For example, treatment 11 occurs in the same blocks as treatments 
12, 13, 14 and 15 and also 21, 31, 41 and 51, but never with the remaining 16 
treatments. Satisfactory comparison are nevertheless possible through inter- 
mediates, and with the appropriate algebraic treatment the designs are very 
efficient. (*) 

The chief virtue of these lattice designs is that the number of replications 
required is independent of the number of treatments. Thus, for example, to 
test 49 treatments we require only 4 replications, whereas using the Balanced 
Incomplete T' ck technique we require 8 replications. 

(h) The Nature of “Blocks” 

We have been discussing in the previous sections devices for circumventing 
the difficulties arising through the restricting size of the “blocks** available. The 
concept of block is an extremely general one, but the following are some examples : 

(a) In the science of agriculture, of course, the block is made up of plots 
of ground immediately adjoining each other, it having been found that 
plots close together tend to have fertilities more comparable than plots 
widely separated. 

(b) For experiments on the nutrition of animals, it has been found that 
animals from the same litter resemble each other in their response to a 
given diet more than animals from other litters. 

(c) In experiments on the purification of the products of fermentations, each 
fermentation may be conceivably different from the others in some slight 
degree, so may be considered as a block. 

(d) If, in any experiment in which the personal idiosyncracies of th,e operator 
may effect the result, the total number of treatments to be compar^ 
involves the use of more than one operator, then clearly the operator is 
the block. 

(e) There are many other possible sources of inhomogeneity in the experi- 
mental background that may be considered as blocks. For ex^ple, in 
a comparison of treatments involving use of a thermostat or incubator 
it is obviously preferable if possible to place all the units with all the 
treatments in the same thermostat or incubator at the same time. Errors 
due to lack of perfection in the equipment will then be at an absolute 
minimum. It may be, however, that the thermostat is not large enough 
to take all the treatments at one time. We then either have to use several 
thermostats, in which case the thermostat is obviously the block, or to 
use the same thermostat at successive intervals of time, in which case 
the time is obviously the block. 

(g) In any experiment in which a number of treatments are to be compared 
and in which the material on which they are applied may change slightly 
with time, and in which only a given number of treatments can be tried 
out in one day, then the day is the block. Examples are where there 
might be differential leakage from a gas holder in the case of a mixture 
of gases, or where a broth, e.g. of penicillin, may be appreciably imstable. 

(h) In many cases there are no clear-cut distinctions of the type illustrated 
above, but nevertheless advantages will result from splitting the experi- 

i*) This design will not be discussed in detail as the author has not found instances 
in industrial work where the comparison of such large numbers of treatments is necessary. 
It is described here, however, as it is the simplest of a series of such designs. A general 
description with refer^ces is by F. Yates i Empire Journal of Experimental Agriculture 
VIII, page 223, 1940. Descriptions and rnethods of computation of several are in C. H. 
Goulden : “Methods of Statistical Aucuy&ia (John Wiley) 1939. 

15 



ment up into blocks. In general experiments done as close together as 
possible in time and space will resemble each other more than those 
remote from them. The air temperature, atmospheric pressure, humidity, 
adjustment of the machines, mood and behaviour of the operators, and 
many other less tangible factors will all tend to vary less within a short 
space of time than over a longer period. Accordingly, if the experiment 
can be so split up into blocks the effective error will be appreciably 
reduced. 

(i) Multiple Factor Experiments 

The type of experiment which we have been discussing up to now has had only 
one “independent variable’’ or “factor.” In much industrial work, of course, 
there are a relatively large number of independent variables which we wish to 
investigate, and we therefore require a further set of techniques. 

The classical ideal of experimentation is to have all the independent variables 
but one constant. It is frequently not recognized that this may sometimes be 
far from the ideal, for in order to get a fair assessment of the effects of varying 
the particular variable one must allow the others to vary over their full range as 
well. If they were held constant, they would have inevitably to be held constant 
at completely arbitrary values. Thus varying factor A from its normal value 
Ai to some other value may produce a given change in the quality of the pro- 
duct when factor B is at a value of Bi but a different change in the quality of the 
product when factor B is at a value Bj. The factorial design of experiment is 
designed to detect this type of effect, and at the same time gives maximum effi- 
ciency, i.e. gives the maximum amount of information about the system being 
experimented upon for a given amount of work. 

The technique of factorial experimentation was developed in the science of 
agriculture in the twenties and thirties largely at the Rothamsted Experiment 
Station by R. A. Fisher and his colleagues. 

Fisher’s approach to experimentation differs in two fundamental aspects 
from the classical one-variable-at-a-time ideology. Firstly, he stresses the 
importance of obtaining an accurate estimate of the magnitude of the error varia- 
tion, rather than its minimisation. The accurate estimate of the error variation 
is necessary in order to apply an exact test of significance. 

Secondly, Fisher emphasises the advantages to be gained from including in 
the same experiment as many as possible of the factors whose effects are to be 
determined. The advantages of this are — 

(a) much greater efficiency ; estimates of a given standard of accuracy for 
the effects can be obtained in a much smaller total number of observa- 
tions. When the experiments are on the full industrial scale this may 
represent a very considerable reduction in their cost ; 

(b) information is given on the extent to which the factors interact, i.e. the 
way in which the effect of one factor is affected by the other factors ; the 
experiments will therefore give a wider inductive basis for any conclusions 
that may be reached. To give an illustration of this, imagine one man 
carrying out a comparison of two catalysts for a contact oleum plant, the 
conversion efficiency being the dependent variable of interest. He would 
hold constant all his other factors, e.g. in particular gas inlet temperature 
and rate of loading. He might conclude from his experiments that 
catalyst A was better than catalyst B. Meanwhile a second experimenter 
could conclude from experiments on inlet gas temperature that 420®C. 
was the best ; the experiments on catalyst may have been made with a 
temperature of 450°C., and hence were invalid. The first experimenter 
could point out that the second experimenter’s work was made with 
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catalyst B and hence his conclusions are not valid. Similarly, a third 
set of experiments on rate of loading might show that the rates chosen 
for the first two sets of experiments were incorrect, and hence the con- 
clusions drawn therefrom are invalid. However, the loading experi- 
ments might have been carried out at the wrong temperature and with 
the wrong catalyst, and hence would not be valid. It is possible, of 
course, that 420 °C. is the best temperature irrespective of which catalyst 
md loading rate was employed, and that catalyst A is the best catalyst 
irrespective of which temperature and loading rates were employed, and 
that the best rate of loading remains the best irrespective of which catalyst 
and temperature are employed. To make these assumptions without 
demonstrating their validity is quite unjustified, however. This type of 
effect, namely the effect of one factor being dependent upon the value of 
another factor, is known as an “interaction.’’ 

(j) The Three Factor Experiment 

(i) Classical 

Let us consider a hypothetical experiment, in which we wish to investigate 
the effects of three independent variables P, Q and R (which may be temperatures, 
pressures, flow rates, concentrations, etc.) upon a dependent variable x (which 
may be yield, purity, etc.). 

Suppose that in the first instance it will be adequate to investigate them each 
at only two levels. Thus suppose the normal values for the process are Pi, Qi, 
and Ri, and we wish to find the effects of increasing them to Pg, Qa and Rg 
respectively. How would we carry out this experiment in the classical style ? 

We would first do an experimental control with values Pi Qi Ri. A value 
of X would be obtained which we denote by (Pi Qi Ri)x. 

To obtain the effect on x of changing P from Pj to Pg (which we will symbol- 
ically denote by (P, — Pg)! we do an experiment with values Pg Qi Ri. A value 
of X will be obtained which we denote by (Pg Qi Ri)x. It is then clear that 
.(Pi - P2)x = (Pi Qi Ri)x - (Pa Qi Ri)x 

Similarly, experiments at Pi Q 2 Ri and Pi Qi Ra give us 
(Qi - Q2 )x = (Pi Qi Ri)x — (Pi Qa Ri)x 
and (Ri Ra)^ == (Pi Qi Ri)x — (Pi Qi Ra)! 

Now a very important point is that each of these experiments would have 
to be repeated not less than once, for without not less than two observations it 
is quite impossible to make any estimate of experimental error. Without such 
an estimate it is quite impossible to say whether any apparent difference between, 
say, (Pi Qi Ri)x and (Pj Qi Ri)x is real or is due to the errors of measurement 
and sampling, etc. We therefore require our four experiments to be repeated 
once, making eight observations in all. Each level of each effect will be given 
by two observations, i.e. we will be comparing the mean of two observations with 
the mean of two observations. 

No information is given by this experiment as to any possible interactions 
between the factors. Thus for example, (Ri — R2)x with P at Pi may well be 
different from (Ri — Rgx) with P at Pg, but our experiment is quite unable to 
detect such an effect. 

(ii) Factorial 

With the factorial design we should carry out experiments at all combinations 
of P, Q, and R, namely (Pi Qi Ri), (P2 Qi Ri), (P 1 Q2 Ri), (P 2 Q2 B-i)? (Li Qi E.2)> 
(P2 Qi R2), (Pi Q2 Rj?), and (P2 Q2 R2)> making a total, of eight observations, the 
same total as in the classical design. 
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It may make it easier to grasp the significance of the above set of combinations 
if we realize that they are the co-ordinates of eight corners of a cube whose axes 
are P, Q, and R (Figure 1 ). 



Having such a picture we can now visualise how we obtain our estimates of 
the three main effects. We obtain our estimate of (P^ — by comparing the 
average of the P.pl^e with the average of the P, plane ; in detail, the average 
/p ''^*^** corresponding set 

(Pj Qi Ri)x, (Pj Qj Ri)x, (Pj Qj (Pj Qj Rj^jj Although the other two 
variables Q and R are varying, they are varying in such a way that they correspond 
m the two sets, and thus to a first approximation their disturbing effects cancel 

rrJH “ R2)x are obtained similarly. 

L ^ advantage of our factorial design is that our main effects are obtained 

as the difference between the mean of one set of four observations and the mean 
of four observations. In the classical design our main effects were 
the differences between means of two observations. We have thus obtained 
double the accuracy for the same number of experiments. If this experiment is 
bemg carried out on the full industrial scale at considerable cost and trouble this 
factor can be of the greatest economic importance. ^ 

The second advantage of the factorial design is that it provides us with an 
estimate of the possible interactions between the main effects. 

Consider our cube again. To obtain the interaction between P and R 
average the front plane with the back plane, i.e. we average our Q. This gives 
us a table as bel^, the superscript Q indicating that figures are averaged over Q 

(Pi Ri)9 (p* Rx)9 


(Pi Ri)9 


(p. Ri)9 


The top line gives us (P^ — P,), at R = R, and the bottom line gives us 
(^1 — rsli at K = Rj. Since each figure m the table is the average of two results, 
we have a valid estimate of error, and can determme whether (P^_p,)Ri^ the 
superscript indicating that the figure is obtainfd^,with R at Rj, is significantly 
not equal to (Pj — P2)R2. If they do differ significantly, then there is an inter- 
action between P and R. The other interactions can be derived .similarly. 
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The computation of the three factor experiment is discussed in Chapter 
XI (d). 

(k) Higher Factorial Experiments 

We have been discussing a three factor experiment, all factors being at two 
levels. We can represent this symbolically as 2 X 2 X 2, or 2^. The extension 
to a fourth factor is obvious, though not capable of a geometrical representation. 

If the fourth factor is represented by S, then we can regard the eight experiments 
so far mentioned as having been carried out with S at its lower value, Si, and to 
introduce the fourth factor S we carry out a similar eight experiments with 
S now at its upper level S 2 . The estimate of the main eifect of S is then given 
by the difference between the average of the first eight with S = Si and the average 
of the second eight with S = Sg. The main efects of the other three factors 
are obtained similarly. There are now six possible first order interactions, taking 
the factors in pairs, namely PQ, QR, PR, PS, QS, RS, and to obtain each of these 
we average over the two factors not mentioned in the interaction, e.g. for PQ we 
average over R and S. 

It will be apparent that the introduction of the fourth factor has improved 
the experiment appreciably. The main effects are given as a result of the com- 
parison of averages of eight results (instead of four) : the first order interactions 
are given as a result of the comparison of averages of four results (instead of two). 
Further, it is now possible to check on the possible existence of what are known as 
second order interactions. As an example of one of these, consider the interaction 
PQ : it is possible that its magnitude may depend on the value of R, being different 
when R = Ri from what it is when R = R 2 . Such an effect we call a second 
order interaction and denote as PQR or P X Q X R. Actually it is found that 
such interactions are symmetrical with respect to three factors. PQR can be 
regarded as the interaction of PQ with R, QR with P, or RP with Q. With four 
factors there are four possible second order interactions, PQR, QRS, RSP and 
PQS. Our four factor experiment will estimate the existence of all these possible 
effects (the procedure is given in Chapter XI (e)). 

It will be noted that the four factor experiment is even more efficient relative 
to the classical design than the three factor experiment, which achieved double 
the accuracy for the main effects. 

The four factor experiment requires 2^ = 16 runs, and its main effects are 
given as the comparison of the average of 8 nms with the average of 8. The 
nearest equivalent classical experiment is to carry out the five combinations 

Pi Qi Ri Si Pa Qi Ri Si 
Pi Qa Ri Si 
Pi Qi 1^2 Si 
Pi Ql P-l ^2 

each three times which would require a total of 15 runs. The main effecte are 
then given as the comparison of averages of 3 runs, which are markedly inferior 
to the factorial experiment. The latter of course also checks on the existence of 
the six possible first order interactions and the four possible second order inter- 
actions. 

It is clear where the immense superiority of the factorial experiment is arising. 
In the classical experiment all the 12 observations on the last four combinations 
are each only being used once, to contribute information on only one effect. Thus 
the three replications of P 2 Q 1 R 1 S 1 contribute information only on the main effect 
of P and add nothing whatever to our knowledge of the other three factors or to 
the various possible interactions. Similarly for PiQ 2 P‘iSi, P 1 Q 1 R 2 S 1 and 
RiQiRjSg. Only the combination of PiQiRiSi is used several times. In the 
factorial experiment, on the other hand, every observation is used many times 
over in a different manner each time. 
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The extension to more than four factors or to factors at more than two levels 
follows naturally. 

Finally, it will have been noted that the larger factorial experiments require 
a fair number of experiments, e.g. a2^or2x2x2x2 experiment requires 
2^ = 16 runs. This number may be larger than can be comfortably put into a 
block. Methods have been worked out which evade these difficuj,ties and enable 
us to put these 16 runs into 2 blocks of 8 runs or into 4 blocks of 4 runs. These 
are discussed in Chapter XIV. 

CHAPTER II 

FUNDAMENTAL STATISTICAL CONCEPTIONS 

(a) Statistical Terminology 

Statistics, like every other science, has its own words and symbols of special 
meaning. These may be confusing to the newcomer, but are almost essential 
for the normal use of its methods. 

We apply our statistical methods to sets of numbers obtained by making 
measurements upon some property of our product or process. The property 
might be yield, purity, or such qualities as the life in days or in tons throughput 
of a piece of plant before failure. Or a machine may be making articles whose 
size, weight, or metallurgical hardness we measure. This variable we refer to 
as the ** dependent variable,’’ and we are frequently interested in the effect upon 
it of the conditions under which it is obtained. These conditions can generally 
be specified in terms of a number of ^‘independent variables,” which we can 
refer to as “factors.” Thus in many chemical processes the conditions can be 
determined by specifying values for the temperature, pressure, time of reaction, 
proportions of reactants. A given value for an independent variable is referred 
to as a “level.” A given observation on a dependent variable is referred to as an 
“individual.” 

An important concept is that of “population,” broadly speaking a large 
number of individuals from a particular source. We frequently wish to estimate 
the mean value of a property of a whole set of individuals from a small number 
of them ; this part withdrawn from the whole we refer to as a “sample.” 

The use of a bar over a symbol indicates that the average is meant — thus 
X is the average of the set of x’s considered. S indicates summation, i.e. addition, 
of the series of terms considered. 

An awkward conception is “degrees of freedom.” It is closely related to the 
number of observations, in general the number of obsen^^ations minus the number 
of constraints imposed on the system. An accurate definition is difficult, how- 
ever, and we will be content with giving an explicit account for each particular 
application. It is interesting to note that rather similar conceptions will be 
.familiar to physical chemists in the application of the phase rule and in the 
quantum mechanics of molecules. 

(b) Probability 

Probability is expressed on the scale 0 to 1, the former implying that an 
event will certainly not occur (or that a hypothesis is not true) and the latter 
implying that the event is certain to occur (or that the hypothesis is definitely 
true). On this scale a value of 0.5 implies that the event is as likely to occur 
as not (or that the hypothesis is as likely to be true as not). 

Frequently in practice this scale of fractions is transformed into one of 
percentages, e.g. a probability of 0.05 is referred to as a 5% probability. 

(c) Populations : Tests of Significance 

“The idea of an infinite population distributed in a frequency distribution 
in respect of one or more characteristics is fundamental to all statistical work.” 
From a limited experience of some function or of individuals of a given type 
“we may obtain some idea of the infinite hypothetical population from which 
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our sample is drawn, and so of the probable nature of future samples to which 
our conclusions are to be applied. If a second sample belies this expectation we 
infer that it is, in the language of statistics, drawn from a different population ; 
that the treatment to which the second sample’’ has been subjected “did m^e a 
material difference . . . Critical tests of this kind may be called tests of signi- 
ficance, and when such tests are available we may discover whether a second 
sample is or is not significantly different from the first” (Fisher, “Statistical 
Methods for Research Workers”). 

The most common type of test has the form in which we assume that the 
means (Student’s t test), the variances (Fisher’s z test), or the fit of the assumed 
distribution to the actual data (Use of x^) are the same for the two samples and 
calculate the probability of getting such results as we did get on the basis of this 
hypothesis being true. So long as the probability is there 

is no strong reason to doubt the hypothesis. When the probability is O.OSnow- 
evef,^irvaiSS*l?*^an^*^^ be obtained once in 20 times if there 

were really no difference, and hence it is generally taken that this indicates that 
the hypothesis is not true. Smaller values of the probability are, of course, an 
even stronger indication that the hypothesis is incorrect. Thus a sraal Loxabability 
corresp onds to a high degree of significance. 

~^{d) Significance Levels 

It is entirely a matter of judgment what probability level we take as being 
significant. For many purposes the 5% level is accepted, but we must realize 
that this means that 1 in every 20 times we will assert that an effect exists when 
it really does not. If this is too frequent an error, then we must adopt a higher 
significance level, say 1%. The trouble now is that we will be ignoring effects 
when they really do exist. The best practical compromise is probably to reg^d 
results at or even just under the 5% level of significance as well worth following 
up. For presentation of final results and conclusions, however, one would prefer 
a higher level. This can be obtained, if the effect really does exist, by obtaining 
more observations. 

Throughout this book we shall use the word “significant” in the sense 
“statistically significant.” “Significant” accordingly will mean merely that the 
data in question is sufficient to establish that such and such an effect exists with 
the stated level of significance : it is not meant to imply that the effect is necessarily 
of great importance from the practical point of view, though of course it might 
be. The converse also holds : an effect may appear on a given set of data to be 
non-significant, but this may be merely because the data is inadequate to show it 
and the effect may be of the greatest practical importance. Oiu: aim generally 
is to give an estimate of each effect together with the so-called fiducial or confidence 
limits within which it probably (with a stated level of probability) lies, though in 
some cases the solution of this problem is by no means simple. 

For example, if an' effect has to be of magnitude 15% to be of practical 
importance, and our data gives us as our estimate of it 5% i 2%, it is clear that 
the effect is statistically significant but not of practical importance. Alternatively 
if our result was 5% ± 10%, then it is not statistically significant on our present"^ 
data but may still be of practical importance. 

(e) Computation 

The application of statistical methods generally involves a certain amount of 
computation. A large part of this is the summing of numbers and of their squares ; 
this can easily be performed with a table of squares (“Barlow’s Tables of Squares, 
etc.” (E. and F. M. Spon) is a very complete set), and with a*simple adding machine. 
For much correlation work on calculating machine (one that will multiply and 
divide) is almost essential. There are a variety of models available ; it can be 
safely said that the better the machine the more accurately and rapidly can the 
work be carried out. An electric machine with automatic division and multi- 
plication is really desirable. 
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(f) Measures of Variability 

In statistical work we are largely concerned with the variability in determina- 
tion of some property of a system, e.g. the variability in yield or in quality of 
some chemical process. 

The variability of a set of observations can be assessed in a variety of ways : — 

(a) The “range,” i.e. the difference between the largest and the smallest of 
the set, generally denoted by the symbol w. 

(b) The “mean deviation,” i.e. the mean of the deviations of the observations 
from their mean. 

(c) The “variance,” i.e. the sum of the squares of the deviations of the 
observations from their mean, divided by one less than the total number 
of observations, generally denoted by the symbol a*. 

Thus for the set of observations 

9, 11, 14, 11, 13, 12, 10, 13, 12, 15 

(a) the range would be 15 — 9 = 6. 

(b) the mean deviation would be obtained by calculating the mean as 
(9 + 11 + 14 . . . + 12 + 15)/10 = 120/10 = 12.0, the deviations as 
(12 — 9) = 3, (12 — 11) = 1, (14 — 12) = 2, etc., and taking the mean 
of these deviations as (3 + 1 + 2 + . . 0/^^ ~ 14/10 = 1.4. 

c) the variance denoted by a* would be defined as 



where x is the mean of the x’s, n is the number of observations, (x — x)* 
represents the square of the deviation of an x from the mean, and the 
summation sign S indicates that (x — x)* is to be obtained from all the 
x’s and summed for them. Thus the variance of the set of numbers 
quoted above would be (the mean x is here equal to 12.0) 

^ (12 ^ 9)* + (12 — 11)* + (14 — 12)* + ...._ 30 ^ 

a - ^ -3.33 

Of these three measures of variability, the range is much used for small 
samples (n not greater than 10) in process control work (the quality control chart, 
discussed in Chapter VIII) on account of its arithmetical simplicity. It is clear 
that it does not utilise the whole of the information from the data, for the detail 
of the intermediate results does not enter into the determination. 

The mean deviation has the virtue of arithmetical simplicity, but its properties are 
such that it is difficult to employ mathematically, and it must be considered obsolete. 

The variance is generally greatly to be preferred as a measure of variability, 
for its properties are well known and tests of significance based upon it are avail- 
able. It also has the valuable property of being additive, i.e. if a process is subject 
to a number of separate variances ori*, a^*, .... due to several independent causes, 
then the total variance cj is numerically equal to their sum, i.e. a-J = 0 ^* 

-f 02 * + a,* -f The converse of this is of particular importance, where 

we analyse the total variance of a system into components attributable to various 
parts of the process. This is discussed at length in Chapter VII. A further 
advantage of the variance as a measure of variability is that it has the maximum 
efficiency, i.e. it extracts more information about the variability from the data 
than the other measures. 

The square root of the variance is termed the “standard deviation” and 
denoted by the symbol o. 

The relation between range and standard deviation is discussed in Chapter 
VIII (e). 

(g) The Calculation of Variances 

It is often troublesome to calculate the mean x and then find the deviation 
of each observation x from it before squaring and summing these deviations. 
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We therefore use the algebraically equivalent formula 
For the above set of numbers 

IP-H 142 + ... + ]/(10 — 1) 

1202 

_ (1470 ^)/9 == (1470 - 1440)/9 = 30/9 = 3.33 

To lighten the arithmetic in calculating variances, it is frequently worth 
while to take an arbitrary zero and transform all the numbers on to the new scale. 
Thus with the above set of numbers we might shift the axis by 9 units so that 


they now become 

0, 2,^ 5, 2, 4, 3, 1, 4, 3, 6. 
The variance of these observations is then 

= [(0“ + 2» + 5» + . . . + 6“ - (Q + 2+5^-^ 


+ 6 )“) 


]/(10-l) 


= (120 — ^/9 = (120 — 90)/9 = 30/9 = 3.33 

as before. Thus shifting the zero has not affected the estimate of the variance. 
It should be clear that this should be so : we are measuring the variabiHtyabout the 
mean, and even if we added 100 to all the observations the variability remains 


the same. 

If the numbers had all been 100 larger, then the sum of squares would have 
been of the order of 100,000, i.e. the arithmetic would have been rather unpleasant, 
and would have been very much lightened by shifting the zero by lOO xmits, 
giving us our first calculation in which the sum of squares was 1470, or by 109 
units, giving us our second calculation in which the sum of squares was only 120. 

The business of shifting the origin is largely a matter of personal taste. It 
has to be done carefully, but when done the subsequent arithmetic is much lighter. 
It has two advantages. Firstly, one will then be working with numbers less th^ 
100, and with a little practice it is easy to carry the squares of these numbers m 
one’s head. Secondly, the squares of two figure numbers carmot have more than 
four figures, and since one has only four fingers on one’s hand this makes entering 
them on an adding machine easier. 


(h) The Definition of Variance 

The definition of variance is formally not 2(x — x)V(n — 1), as quoted 
above, but S(x — x)Vn, and it is quoted as the latter in many statistic^ texts 
In any practical application, however, we should use the formula with (n 1) 
as the denominator. 

The reason for this apparent confusion is as follows : — 

In calculating S(x — x)* we should be calculating the sum of squares of the 
deviations from the true mean of the population from which the sample is drawn. 
We have only an estimate of the mean, the mean of our sample, which will not 
in general be identical with the true mean of the whole population. The s^ 
of squares of the deviations from the mean of the sample will be smaller than 
the sum of the squares of the deviations from the true mean. To see this, suppose 
we have two observations 1 and 3, and that the true population mean is acmally 1 . 
The sum of the squares of the deviations from the true mean is (1 — 1)“ + 

= 4 • working from the true mean we would use n as the divisor to get the esti- 
mate' of the variance as 4/2 = 2. The sum of the squares of the deviations from 
the sample mean is (2 — 1)» -f (3 — 2)* = 2 ; if we used n as the dm^r the 
estimate of the variance is 2/2 = 1, whereas if we use (n 'S the divi^r we 
get 2/(2 — 1) = 2 as our estimate of the variance. It is app-i jnt that the use 
of (n — 1) instead of n as the divisor helps to compensate for the use of the sample 
mean instead of the true mean. 
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The number (n — 1) is known as the degrees of freedom of the variance, 
and is used in making tests of significance in comparing one variance with another 
(Chapter IV). 

(i) Distributions 

It is a matter of common experience that we cannot manufacture articles 
identically ; successive individuals differ slightly in their size, quality, etc. 
Similarly, if we make repeat determinations- of some physico-chemical quantity, 
e.g. the yield from some process, the values we obtain will not be exactly similar : 
this variability would be due not only to the (possible) variability of the process, 
but also to the errors of measurement. 

It can be shown theoretically that if we assume that there is a very large 
number of very small errors all operating independently, then we get a distribution 
of the form shown in figure 2, known as the Gaussian distribution. 

This Gaussian distribution has the following valuable properties : — 

(a) it is completely defined by two statistical parameters, (i) the mean (fixing 
its position along the x-axis) and (ii) the standard deviation (fixing its 
width or spread). 

(b) There is a relation, for which tables are already calculated, between the 
deviation from the mean, expressed as a multiple of a, and the frequency 
with which these deviations occur. Thus to particularise, 95% of the 
individuals should lie between — 1.96 cr and +1.96 a on each side of 
the mean, and 99.8% between — 3.09 <s and + 3.09 a on each side of 
the mean. Similarly, if y/e get a deviation from the mean of d* WiS. cal- 
culate the quahfity known as Student's t defined by d/q, and tables are 
available to show how often a deviation of this size should occur. 

WE DlSTRIBUTiON. 



Figure 2. 
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Figure 3 shows a typical distribution occurring in the chemical industry. It 
refers to the quantity of one ingredient in a four component mixture. The 
lieight of the y-axis at any interval of the x-axis indicates the number of batches 



Figure 3 

which had their content of this ingredient in that interval. It will be seen that 
though the experimental d-istribution does not follow exactly the theoretical 
Gaussian curve, yet its features are very similar. It must be remembered that 
we have here only a total of 138 observations : in any given interval the number 
to be expected is quite small, therefore, and we must expect random fluctuations 
about the expected value. If the total number of obseiwations was 1000 we 
would expect the experimental distribution to be a closer approximation to the 
Gaussian, and with 10,000 observations even closer. 

IMany of the -tests of significance that we make are based on the assumption 
that our variable is distributed according to the Gaussian form. It is true that 
distributions found experimentally often are not exactly of the Gaussian form, 
but they almost invariably approximate reasonably closely to it. It is fortunate 
that the departure from the Gaussian form has to be very severe indeed before 
our tests of significance become appreciably inaccurate. The discrepancies for 
any distribution with a hump somewhere near the middle and tailing off to zero 
either side are quite negligible. 

(j) Grouped Frequency Distributions 

If the number of observations whose standard deviation is to be calculated 
is large, say 40 or more, the labour involved can be reduced by using what is 
called a grouped frequency distribution. 

We will use the data of the previous section as an illustration (see Table 2.1). 

We divide the variable (here percentage of the component) into about 10 to 
20 equal sized groups (column headed X). We count up the number of observa- 
tions occurring in each of these groups (column headed f). This column f is the 
vertical axis in figure 2. We select a new origin somewhere near the middle of 
the range, and lay out a new scale as shown in the column headed x. The 
remainder of the arithmetic should be clear. We obtain cjx in our x units. Since 
1 x-unit equal 0.1 X-unit, gx = 0.182. Since x = 0.59 this means that the 
mean is 0.59 units up from the x-origin, or 0.59 — 0.06 units up from the X-origin. 
The X-origin was at 2.20, hence the mean is at 2.20 + 0.06 = 2.26 X-units. 
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Table 2.1 


X 

f 

X 

fx 

fx® 

1.75—1.84 

1 


—4 

16 

1.85—1.94 

6 

—3 

—18 

54 

1.95—2.04 

8 

—2 

—16 

32 

2.05—2.14 

24 

—1 

—24 

24 

2.15—2.24 

26 

0 

—62 

0 

2.25—2.34 

32 

1 

32 

32 

2.35—2.44 

23 

2 

46 

92 

2.45—2.54 

9 

3 

27 

81 

2.55—2.64 

6 

4 

24 

96 

2.65—2.74 

3 

5 

15 

75 


138 


+144 
— 62 

502 








+ 82 



[502-®]/(138-l) 

= (502 — 48,72)/137 = 453.28/137 
= 3.31 

ax - 1.82 

ax = 0.182 
X = 82/138 = 0,59 
X = 2.20 + 0.59/10 = 2.26 

In our discussion of the properties of the Gaussian distribution it wcs remarked 
that 95% of the total number of observations would lie within ±1.96 a of the 
mean. These limits here are at 2.26 — 1.96 x 0.18 = 1.91 and at 2.26 + 
1.96 X 0.18 == 2.61. Inspecting our table we find that those in the two end 
intervals are 1 and 3, making a total of 4 out of 138, or 2.9%. However, the 
1.85 — 1.94 and 2.55 — 2.64 intervals probably contain some outside the limits 
less than 1.91 and greater than 2.61, so the actual agreement is reasonable. 

Similarly, we would expect very few (of the order of 2 out of 1000) observations 
outside the limits 2.26 — 3.09 X 0.18 = 1.70 and 2.26 + 3.09 X 0.18 = 2.82. 
Inspecting our table we find none of our 138 outside these limits, in good accord 
with this prediction based on the Gaussian distribution. 

(k) Lognormal Distributions 

For certain types of data there are sometimes good reasons for considering 
the use of the logarithm of the variable rather than the variable itself. Thus 
in some cases the larger the value of the variable the more erratic will it become, 
e.g. in corrosion experiments the larger the corrosion the less the agreement there 
is between duplicates. Under these circumstances we would expect a skewed 
distribution with a long tail to the upper end. However, any deviation which 
is proportional to the mean of the unlogged data when expressed logarithmically 
becomes constant. Fi^re 4 gives the distribution of throughputs before failure 
in thousands of tons of a piece of acid plant. Figure 5 is the same data grouped 
according to the logarithm of the throughput. One individual tends to give the 
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Figure 5. 
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CHAPTER III 

SIGNIFICANCE OF MEANS 


(a) Significance of a Single Mean 

Suppose we have a population distributed with standard deviation a about 
its mean X. Suppose we take samples of n individuals at a time. The first 
sample will have a mean Xi, the second sample will have a mean Xj not necessarily 
the same as Xj, and so on. It is a valuable property of these sample means that 
they are distributed about the population mean X with standard deviation (a/ Vh). 

Now if we take an individual at random and find that it lies further from 
the population mean X than 1.96 a then we conclude that this individual is not 
very likely to belong to the population we are considering. In exactly the same 
way if we take a sample of n individuals and find that its mean is more than 
1.96cT/Vh from the population mean X then we conclude that it is not very 
likely that this sample of n individuals was drawn from the population we are 
considering. 

The ratio of the deviation of the mean x of a particular sample of n individuals 
from an expected value E to its standard deviation 0 / Vn is known as Student’s t, 
i.e. 

_ X---E 

a/Vh 

We will deal with a specific example. Two analysts A and B analysed a 
series of chemical mixtures, and obtained the results below for the percentage of 
one ingredient. We would expect the differences between the operators for each 
mixture to average zero if there was no systematic difference between the operators. 

The essential point about this example is that we have ten numbers (the last 
row in Table 3.1) and we wish to test whether their mean differs significantly 
from zero. 

Table 3.1 


Mixture 

1 



B 



7 

8 

9 

10 

Analyst A . , 

7 

9 

8 

10 

8 

11 

9 

8 

9 

8 

Analyst B . . 

11 

7 

10 

10 

9 

10 

10 

. 9 

11 

11 

A — B = x 

—4 

2 

—2 

0 

— 1 

1 

— 1 

— 1 

—2 

—3 


The average value x for A — B is — 1.10, and the expected value E is 0. 

We calculate the standard deviation of these individuals as 
V[[(-4)» + 2 * + . . . + ^ (--4 + 2 + . . . -3) V10]/(10^1)] = 1.79. 

The standard deviation of the mean x is given by cx/Vn = l-^VVlO = 0.56^. 
We then calculate t as 1.10/0.565 = 1.95. 

Now the value we used for a is of course only an estimate of the population 
value. Being based on ten individuals it is not as accurate as if it were based on 
one hundred or one thousand individuals. The table of the significant values of 
t takes this into account (Table I in the Appendix). Examining this Table, we 
see that for infinite degrees of freedom (the degrees of freedom here are numeric- 
ally one less than the numbers of observations) a value of t as large as 1,96 should 
only occur 1 in 20 times (P = 0.05) if the hypothesis being tested is true. Here 
the number of observations is 10, so the degrees of freedom aie 9, For the 
5% level of significance we would need a value of t of 2.26. Our present value 
is 1.95 and this is less than the value for the 5% level but is greater than the value 
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1.83 for the 10% level. The significance of the present result is thus between 
10% and 5%, i.e. our hypothesis that there is no difference between the two 
* analysts is in reasonabre^^accbrd with the data. If our values of t had been a 
little larger, however, we would have had to regard the hypothesis as incorrect 
and had to accept the conclusion that there almost certainly was a genuine difference 
between the analysts. 

(b) Confidence Limits for a Single Mean 

We have shown that the difference of the mean of this quantity (A — B) is not 
significantly different from zero. This attainment of non-significance might be 
due to the quantity not differing from zero, but alternatively it might be due to 
the inadequacy of our data, i.e. our data may be insufficient to give us a reasonably 
accurate estimate of the quantity. ’ We can estimate which of these two alternatives 
is operating as follows. 

We are going to determine the limits between which we can be reasonably 
confident that the true value of the mean, as determined from our sample, lies. 

The first requirement is to decide what we mean by “reasonably confident.” 
We might consider that if we are right 19 out of 20 times (equivalent to 95%) 
we would be satisfied. 

We would then look np in the table of t for 9 degrees of freedom its value 
for the 5% level (95% chance of being right is equivalent to a 5% chance of being 
wrong). Here t is 2.26. Then the limits iL on either side of the sample mean, 
where a = the standard deviation of the population and n is the number of 
individuals in the sample, are 

± L = i: t (<j/ Vn) 

= ± 2.26 X 0.565 = ± 1.28. 

Thus it is reasonably certain (within that chosen probability level) that the true 
value of the mean lies between — 1.10 -f- 1.28 = 0.18 and — 1.10 — 1.28 = — ^2.38, 
These limits are known as 95% confidence limits. (^) 

If 19 out of 20 was not sufficiently certain for us, we might choose 999 out 
of 1000 (99.9%) and find from the table for t the value for 9 degrees of freedom 
and the 0.001 level as 4.78. The limits would then be — 1.10 ± 4.78 X 0.565 
or from — 3.80 to 1.60. 

We now consider the actual width of these limits (the 95% or 99.9%, or 
whatever level we have chosen). They fix the accuracy with which we have 
determined the mean. If the limits are narrow our determination is accurate, 
if they are wide the determination is not very accurate. These considerations 
should indicate whether the non-significant result we obtained was due to the 
real equality of the mean with the expected value or due to the inadequacy of the 
data. Here, where the mean has a 95% chance of lying an^^here between 0.18 
and — 2.38, it would seem that our determination is rather inaccurate.!*) 

Greater accuracy in a mean can be obtained by taking more observations, 
i.e. by increasing n, since the standard deviation of the mean (or standard error) 
is a/Vn. Doubling the number of observations will give limits approximately 

= 0.707 of the present ones. Quadrupling the numbers would give limits 

v2 

— = 0.500 of the present ones. 

V4 

(^) Strictly speaking, these are “fiducial limits” and not confidence limits. For practical 
purposes the distinction is so slight as to be not worth making, and because of its greater 
intelligibility the term confidence limit is used here. 

(®) A more accurate statement would be **If we say that the true population mean lies 
between 0.18 and — 2.38, then the probability of our being correct is 0.95.” 
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(c) Comparison of Two Means 

We obtain two sets of numbers, and wish to test whether the mean of one 
set is significantly different from the mean of the other set, or whether the two 
sets can be regarded as drawn from one population. 

Let Xi, X 2 Xni, and x/, Xa', ’ ^ ^ 1 

be the two samples. The method of treatment depends upon the size ox sample. 
Large samples can be considered to be those in which the total number of individuals 
(ni + na) is greater than 30. 


(i) Small Samples 

We calculate the following terms. 


X 



o“=[s(x'*) 


S(x') 

ns 


(S(x'))» 

na 


S denotes summation. 

2(x») - /(nx + ns - 2) 

til 


t 



s/\ 


n^ X Uj 

ni + na 


Having found the value of t, we enter the table of t with degrees of freedom 
(nj + uj _2). If our value of t exceeds that for the 5% level we may take it 
with that degree of assurance that the populations are different. 

Let us consider a specific example. The numbers below represent the 
throughputs in hundreds of tons of acid before failure for two sets of pots, one 
from Foundry A, the other from Foundry B, used in sulphuric acid pot con- 
centration. Is there any reason to suppose that one foundry is better than the 
Other ? 


Foundry A. 

71, 67, 33, 79, 42 

Foundry B. 

73, 80 


XA = (71 4* + 33 + 79 + 42) / 5 — — — 58.4 

Xb = (73 -b 80)/2 = ^-y-= 76.5 

= [(7p + 67» -1- . . . -I- 42» - 73« -h 80« /(5 -h 2 -2) 

= 1596/5 = 319.2 

o = 17.9 

76.5 — 58.4 , /5 
^ “ 17.9 V 5 4- 2 

= 1.21 for (S + 2 — 2) = 5 degrees of freedom. 

Consulting the table of t, we find that for 5 degrees of freedom t must be 
not less than 2.57 to be significant at the 5% level. Accordingly there is no 
evidence that here the populations are different. 

Looking up this value of t in a more complete table of t than the one in the 
Appendix, we find it corresponds to a Ipel of significance of approximately 25%. 
That is to say, in the absence of any difference between tho two foundries, if we 
repeatedly sampled from them we would get as large an apparent difference 
between them as we did here 1 out of every 4 times. The present result, therefore, 
is only a very slight indication of the possible superiority of Foundry B. 


30 




We have shown that the difference in these means is not significantly greater 
than zero. We can further lay down the confidence limits within which the 
difference will have a chosen probability of lying. We proceed on the same lines 
as before. 

We have that the standard error of the difference in the means of the two 
samples, denoted by am, is 


am 




ni + nj 


Hi X nj 


17.9 X 




3 + 2 
5x2 


= 14.9 


We also have t = its value in the table of t corresponding to (nj +,n 2 — ^2) = 
(5 2 — 2) = 5 degrees of freedom and to the significance level chosen. If we 

select the 5% level of significance (95% chance of being correct in the prediction 
that the true value of the difference in the means of the two samples lies between 
the limits stated) then for 5 degrees of freedom, t = 2.57. 

Then if ± L is the maximum deviation likely (at this significance level) we 

have 

L = t am = 2.57 X 14.9 = 38.3 

The actual mean value obtained for the difference between the two types of 
pot was 18.1 The true value will lie 95% of cases ± 38.3 on either side of this, 
i.e. from — ^20.2 to + 56.4 We thus see what a poor comparison is afforded by 
the data cited. There may well be marked differences between the foundries, 
but the present data is quite inadequate to answer these queries. 


(ii) Large Samples 

For large samples the same treatment as for small samples holds, but it can 
be simplified since we do not need to stick to the use of (n — 1) instead of n. 
We calculate the means of the two samples. 

We further calculate the standard deviations ai and of the individuals of 
the two samples. The standard error of the difference in the means of the two 
samples x — x' is then given by 
2 _ , <J2* 

CT = j- 

D ni na 
As before, we find t as 

t = (x — x')/aD 

and enter the table of t with infinite degrees of freedom (the last row m the table). 
Thus if X = 100, Ui = 40, a^® = 120 

X = 110, na = 30, aa* = 60 


then 

and 


OD == 


t 




40 30 


= V5 = 2.236 
= (110 — 100)/2.236 
= 4.47 


This is very much more significant than the 0.1% level of significance, so it 
may be taken as certain that the two means d o differ. 

To lay down the confidence limits + L between which the true value of the 
difference between the two means has, say, a 95% chance of lying, we look up 
t for infinite degrees of freedom for this level, and the limits are (x — • x') ± t od, 
or here 10 ± 1.96 X 2.24 or between 5.6 and 14.4. 


(d) Conclusions 

In general, therefore, in any account of experimental work involving Ae 
determination of a mean in addition to the value obtained, its standard deviation 
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and the number of observations upon which it is based should also be given. 
This will enable readers to assess the level of significance of the result. 

An important proviso in the calculation of a standard deviation is that the 
individuals should be independent. Thus if we made 10 batches of some material, 
and took duplicate samples from each batch, we could not calculate the standard 
deviation on the assumption that we made 20 observations. Clearly the duplicate 
observations on each batch are not independent of each other. The methods of 
dealing with this type of situation are discussed later (Chapter XII (j)). 


CHAPTER IV 

COMPARISON OF VARIANCES 

(a) Comparison of Two Variances 

When we wish to compare the means of two groups of numbers we use the 
Student t test. It is sometimes the case, however, that what we wish to compare 
is the spreads or variabilities of the two sets of figures, and this is a different 
problem. 

The test of significance for this type of problem is due to Fisher (Fisher 
actually dealt with the natural logarithm of the ratio of the square roots of the 
variances, which he called z, but here we will use the simple variance ratio which 
is denoted by F). 

To deal with an example, suppose we have two alternative methods of tech- 
nical chemical analysis. One or both of the methods may have a systematic bias, 
but that does not matter, for all we require is a method that will give a reproducible 
result. 

Table 4.1 gives the results of 6 analyses by Method A on a sample and 7 
analyses by Method B on the same sample. 

Table 4.1 


Method A. 95.6 94.9 96.2 95.1 95.8 96.3 

Method B. 93.3 92.1 94.7 90.1 95.6 90.0 94.7 


The averages for Methods A and B are 95.65 and 92.93 respectively. If we 
wished to test the significance of this apparent difference we would use the 
Student t test. Here we are comparing the variances, which we proceed to 

= [(5.6» + 4.9« + . . . + 6.3>) - + - 1) 

= (193.15 — 191.53)/5 = 0.324 

Ob* = [(3.3» + 2.1« + . . . + 4.7») — + • • • + — 1). 

= (90.85 — 60.04)/6 = 5.14. 

We now calculate the ratio of the larger variance to the smaller, termed the 
variance ratio or F. Attached to the variance ratio are two sets of degrees of 
freedom, nj for the larger variance and n* for the smaller. 

Here the variance ratio is 5.14/0.324 == 15.9 for degrees of freedom ni = 6, 
02 — 5. In the Appendix are tables for the variance ratio for a comprehensive 
series of values of ni and for the 20%, 5%, 1% and 0.1% levels of significance. 
If our value of the variance ratio is greater than that given in a particular table 
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for the corresponding degrees of freedom, then our result is more significant than 
the significance level for the table. 

Thus for degrees of freedom ni = 6, Ug = 5, for the 20%, 5%, 1% and 
0.1% levels of significance the values of the variance ratio are 2.2, 5.0, 10.7 and 
28.8 respectively. Our value of the variance ratio was 15.9 : its significance thus 
lies between the 1% and the 0.1% levels (but see below). 

That is to say, assuming our hypothesis to be correct we would be very 
unlikely to get the result we did. Accordingly it is very probable that our hypo- 
thesis is incorrect. Our hypothesi s was that th e two sets .^of figures were drawn 
from the, same the variabilities of the two methods were the same. 

Pur hypothesis having been proved incorrect, we can assert that the variabilities 
of the two methods do differ significantly. 

A point of some subtlety arises here. The variance tables were constructed 
for the purpose of testing whether a variance A was greater than a variance B, 
A being known beforehand as derived in a particular way which distinguished it 
from B. Here we are testing whether a variance A is greater than a variance A^, 
and until we have calculated them we do not know which is A and which is A^ : 
A is that one which turns out to be the larger. The effect of this distinction is 
to alter the level of significance of the 20%, 5%, 1%, and 0.1% tables to 40%, 
10%, 2%, and 0.2% respectively. 

(b) Averaging of Several Variances 

Suppose, extending our previous example, we had a number of determinations 
of variability for each method on a number of different samples. Suppose a 
second sample analysed by Method B gave results 93.1, 91.2 and 92.6. Its variance 
would then (using the revised zero by subtracting 90 from every observation) be 
[3.1* + 1.2* + 2.6* — (3.1’ + 1.2 + 2.6)*/3]/(3 — 1) = (17.81 — 6.9*/3)/2 
= 1.94/2 = 0.97. 

To form the average of the two variances, we weight them in accordance 
with their degrees of freedom. Thus the average would be 

2 X 0.97 4- 6 X 5.14 
2 + 6 

This average is the same as obtained by pooling the sums of squares and 
degrees of freedom for the two samples, i.e. (30.82 + 1.94)/(6 + 2).^ 

This is the correct method for obtaining an average variance. It is incorrect 

(a) to take the average of the variances without regard for the degrees of 
freedom ; 

(b) to take the average of the standard deviations. 

If we have the variances expressed as standard deviations, we should square 
them to convert them to variances, and take the average as indicated. 

The average variance, correctly obtained, can be regarded as having as its 
degrees of freedom the sum of the degrees of freedom of the separate variances 
going to make up that average. Thus in the present example the degrees of 
freedom of the average variance are (6 + 2) = 8. 

In parenthesis, it might be pointed out that in the calculation of the average 
variance of the two samples, the two variances must be calculated separately and 
then pooled. It would be incorrect to pool all the 7 + 3 = 10 observatiors 
together and calculate their variance, because the variance so obtained includes 
not only the variance of repeat measurements on the same sample (which is what 
we require) but also the variance between samples, which is quite irrelevant. 

(c) Comparison of Several Variances 

The Fisher variance ratio test will compare two variances, but not more 
than two. Suppose we have a group of ten machines turning out batches of 
some product, and we measure some quality x on each batch. Suppose we suspect 
that some machines manufacture the product more regularly than others, i.e. that 
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there is less variability in their product. The test of significance for this type of 
problem is generally l^own as Bartlett’s test. 

Column 2 in the table below represents the value of the property x of six 
consecutive batches off machines 1 to 10. The figures in column 2 actually are 
transformed by shifting the zero to simplify the arithmetic : originally each had 
2200 added on to them. 


Machine 

Data 

S(x) 

S(x») 

(S(x))V6 



l0ge(o*i) 

HI 

28, 14, IS, 2, 9, 2, 


1294 

816.7 

477.3 

95.46 

4.5587 


11. 3. 0. 9. 2, 12 

37 

359 

228.2 

130.8 

26.16 

3.2642 



6 

52 

6.0 

46.0 

9.20 

2.2192 


8, 0, 3, —1, 6, —1 

15 

111 

37.5 

73.5 

14.70 

2.6878 ' 


10, 5, 3, 0, 0, 1 

19 

135 


74.8 

14.96 

2.7054 


4, 8, 7, 0, —1, 1 

19 

131 


70.8 

14.16 

2.6502 

7 

li, 2, 3, h, lb, 0 

37 

355 

228.2 

126.8 

25.36 

3.2331 


15, 10, 0, 8, 10, 10 

53 

589 

468.2 

120.8 

24.16 

3.1847 


7, 7, 4, 9, 5, 6 

38 

256 


15.3 

3.06 

1.1184 


lb, 2,— 1,10, —1,7 

27 

255 

121.5 

133.5 

26.70 

3.2847 

B 






253.92 

28.8906 


The columns headed Il(x), S(x®), (S(x))V6 and S(x — x)® where S denotes 
summation, are merely the steps in c^culating the variances for the ten machines. 

The latter are given in the column headed a]. 

We now wish to test the hypothesis that these ten variances could reasonably 
have been obtained from the same population, i.e. that the batches from the ten 
machines are of the same variability. 

We proceed by obtaining (from tables) the natural logarithm of each variance 
(last column), and summing these. Let us denote this sum by S(lagc«yi)- 
take the mean of the variances as 253.92/10 = 25.392 : we can denote this mean 
variance as S*. We take its natural logarithm logeS®, here 3.2343. Let k equal 
the number of variances being compared (here 10) and n equal the degrees of 
freedom of these individual variances (here 5). Then we calculate two terms 
B and C as 

B = k n logeS^ — n S(logeai) 

= 10 X 5 X 3.2343 — 5 x 28.8906 = 17.18 

and C = 1 + 

3 n k 


= 1 + 


10 + 1 

3 X 10 X 5 


1.073 


If B/C is greater than significance for (k — 1) degrees 

of freedom, then there is an indication of that level of significance that the variances 
are not drawn from a homogeneous population. 

Here B/C= 17.18/1.073 = 16.0 with (10 — 1) = 9 degrees of freedom. 
For 9 degrees of freedom for the 10%, 5% and 2% levels of significance or 
here B/C, needs to reach values of 14.68, 16.92 and 19.68 respectively. Here 
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our present value does not reach the 5% level, but is near it, and there is some 
indication that the machines do differ significantly in the variability of their 
product. 

It should be clear that if the largest and smallest variances of a set do not 
differ significantly as tested by the variance ratio test, then clearly those lying 
between cannot differ significantly either, and the whole set can be reasonably 
regarded as coming from a single population. In these circumstances there would 
be no need to employ the Bartlett Test. 

A more general form of Bartlett’s test, for the case where the several variances 
being compared have different degrees of freedom, is available, but will not be 
discussed here. 


(d) Confidence Limits for Variances 

In the same way as we laid down confidence limits for means, we can lay 
down confidence limits for variances. 


(i) Small Samples 

If we wish to calculate the probable upper limit for the true variance (say 9), 
at say the 5% level of significance, we cdculate 



where a® is the variance as calculated from our sample with n degrees of freedom, 
and X* lias its value corresponding to n degrees of freedom and the 5% level of 
significance. 


Thus suppose we have a variance of 10.0 units, based on a sample of ten 
individuals (nine degrees of freedom), then for the 5% level of significance 
5^* = 16.92 and 

~ 16^ ^ 

This is the lower limit. The corresponding upper limit is given by the 95% level 
of x“i for which x* for 9 degrees of freedom is 3.32, whence 


Regarding these limits together, we have a 5% probability of the trae population 
variance being less than 5.31 and a 5% probability of its exceeding 27.1 The 
total probability of the true value being outside these limits is thus (5% + 5%) 
= 10 %. 


Speaking generally, therefore, our confidence limits should be regarded as 
10% and not 5% ; if we wish to have limits nearer 5% we can use the 98% and 
2% levels of x* giving us confidence limits of 4%. 


(ii) Large Samples 

For larger sample (greater than 30 individuals) we can use the fact that the 
standard deviation of a standard deviation a is where n is the number 
of observations. 

Thus if we have a sample of 50 individuals whose standard de viation i s 10.0, 
the standard deviation of the sample standard deviation is 10.0/ V'2 X 50 =1.0. 
The 95% confidence limits are thus 10.0 ± 2.01 X 1.0 (2.01 is the value of t for 
the 5% level of significance and 50 degrees of freedom). 

35 



CHAPTER V 


THE X* test 


(a) Introduction 

We now turn to a different type of problem which requires a technique 
different from those previously discussed. The x® test is applied in general to 
those problems in which we wish to determine whether the frequency with which 
a given event has occurred is significantly different from that which we expected. 

The technique is applied, therefore, to such phenomena as the occurrence 
of heads or tails in the spinning of a coin : the frequency of bursting of drums 
in transit : the numbers of accidents on different shifts : the number of defective 
castings made by different methods. 

To apply the method we calculate the expected frequencies of the events on 
the basis of the theory we are testing, and we then test whether the observed 
frequency differs significantly from the expected. If it does, of course, then our 
theory is unlikely to be correct : if it does not, then our theory is in reasonable 
accordance with the data, and the differences can be due to random error. 

The statistic x* is defined as 

where O is the observed class frequency, E is the expected class frequency, and 
S denotes summation over all the classes. 


In general, it is inadvisable to use the x* test where any expected class 
frequency is less than 5, This point is discussed later in Chapter V (h). 

We will now discuss a series of examples. 


(b) The 1 X 2 Table 

Suppose we tossed a coin 100 times, and we obtained 60 heads and 40 tails, 
Could we say that the coin was definitely biassed ? 

Here we have two classes, heads and tails, and on the hypothesis which we 
are testing the expectation E in each class is 50. 

We have to make a correction for the fact that our actual distribution is 
discontinuous whereas x* is a continuous variable. This correction for con- 
tinuity, as it is called, consists in reducing by 0.5 the values which are greater 
than expectation and increasing by 0.5 those which are less. We thus have 
(59.5 -^50) « (40.5 ^50)» 

^ 50 50 

= 3.61 

The appropriate number of degrees of freedom is found as the number of 
classes to which values can be assigned arbitrarily. Here, given the total, wo 
can only fill one class arbitrarily, because when that is done the second class is 
automatically determined, i.e. if we make 100 tosses and obtain 60 heads, then 
the number of tails must be 40. Accordingly, entering the table of x* with one 
degree of freedom, we. see that to reach the 5% level of signifiance x® would l^iaye 
to be equal to oi greater than 3,84. Hence our result does not reach significance 
at the 5% level, and so we cannot assert that the coin is definitely biassed. 

It might perhaps be stressed that the probability we have derived is not that of 
getting 60 heads exactly, but that of getting either 60 or more heads or 60 or more 
tails. The probability of getting exactly 60 heads with an unbiassed coin is 
100 1 1 

60 I 40 I ' “2^ ~ 0.010843 867 

(this at first sight rather formidable calculation is made easy by the tables of 
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powers and of factorials contained in Barlow’s Tables of Squares, etc. (E. & F. N. 
Spon)). The probability of getting exactly 61 heads is 

^ = 0.007110 733. 


61 ! 39 ! 


2100 


Proceeding in this manner, we can get the series for 62, 63, etc., heads, and sum- 
ming this series we get for the sum of probabilities 0.028 443 968. This is the 
probability of getting 60 or more heads. Adding the similar probability of getting 
60 or more tails, the total probability is 0.056 887 436, or 5.69%. 

The test gives as the probability, using a detailed table of for 1 degree 
of freedom(^), as 5.8%. The two methods are thus in close agreement. 


(c) The X* Table 

It will be noted that the x® Table (Table II in the Appendix) is given for 
values of P greater than 0.10, namely 0.50, and 0.90, 0.95, 0.98, and 0.99. 

The latter values are those which occur when a fit is better than it should 
be. It may seem surprising that this could be so. However, consider the case 
of a trial of 1,000 tosses to decide whether a coin is biassed. Our expectation 
for each of the two classes are 500. If we got exactly 500 of each in our trial, 
this agreement would be remarkable. Thus suppose we repeated the experiment 
a large number of times, then in general we should expect to get observed v^ues 
of the frequencies corresponding to values of x* corresponding to values of P 
scattered over the range 0.95 to 0.05, with 1 out of 40 outside the upper limit 
and 1 out of 40 outside the lower limit. The value of x* which turns up most 
frequently is that corresponding to P = 0.50. Close fits should only occur 'mth 
that frequency given the x* test, and if they occurred more frequently we might 
reasonably suspect that the data was being “cooked.” 

(d) The 1 X n Table 

An example of a 1 x n table, in which n = 3, is below. 


Shift 

Accidents in a particular period 

A 

1 

B 

7 

C 

7 


We formulate the hypothesis that there is no association of number of acci- 
dents with shift, and proceed to test it with the x* test.^ On our hypothesis the 
expectations for the three classes are 5, 5, and 5 respectively. 

... ^ 4 . (ij=jy + 

^ ^ 5 5 5 

The degrees of freedom are two (given the total, we can fill m two classes 
arbitrarily but then the third class is uniquely determined). Entering x* 
table, we find that for 2 degrees of freedom, to reach the 5% level of sig^ficance 
X* has to be equal to or greater than 5.99. Since our value of x^ is less tl^ 
that, our hypothesis is reasonable, i.e. there are insufficient grounds for. asserting 
that’ the three shifts differ in their liability to accidents. _ v 

This result will probably be surprising to the non-statistician, who almost 
certainly would have assumed that it was qmte certain that the three shifts did 
differ It is important to note that the statistician does not assert that the three 
shifts do not differ : he merely states that the evidence that they differ is inadequate. 

(1) Such as that in “An Introduction to the Theory of Statistics’’ : G. U. Yule and 
M. G. Kendall (Charles Griffin), Appendix Table 4a. 
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Of course, we might formulate the hypothesis that the accidents are in the ratio 
1:7:7; would then be zero, i.e. there is no evidence that the accidents depart 
from the ratio 1:7:7. It is thus apparent that the data is .simply inadequate to 
draw any conclusion. 

Suppose we waited for a further period, and the accidents in the total period 
were 2, 14, and 14 respectively. The table would then become 


Shift 

Accidents in total period 

A 

2 

B 

14 

C 

14 


Again testing our hypothesis that there is no association of accidents with 
shift, the total number of accidents is 30, and on this hypothesis the expectation 
E for each category is 10. 

Accordingly 

^ (2-ioy , (14-10)^ , (14-10)^ 

^ 10 10 10 
=:: 9.60 

Such a value of x* with 2 degrees of freedom is more significant than the 
1% levels i.e. on our hypothesis such a value would have been obtained less than 
1 in 100 times. Thus the additional evidence has now made it very probable 
that the shifts do differ in their liability to accident. 

Table 5.1 gives some data obtained on a process for beating paper in a beater. 
Four alternative methods of loading the beater were tried, the numbers of cycles 
carried out for the four cycles being given by the first row in Table 5.1. The 
second row gives the number of chokes of paper between the bed-plate and the 
roll that occurred. 


Table 5.1 


Method of Loading 

A 

B 

C 

D 

Total 

Number of cycles 

8 

10 

9 

13 

40 

Chokes occurring 

5 

8 

9 

10 

32 

Chokes expected 

6.4 

8.0 

7.2 

10.4 

32 


We wish to test the hypothesis that there is no association of frequency of 
choking with method of loading, i.e that frequency of choking is independent of 
method of loading. On this hypothesis the expectation of a choke, the same for 
all methods, is given by 32/4<) = 0.80 chokes per cycle. The third row in 
Table 5.1 gives the expectation for the four methods. We have therefore 

, 1.4® , 0® , 1.8® , 0.4® 

^ 6.4 8.0 7.2 10.4 

= 0.77 

We have used one degree of freedom in making the restriction that our expected 
total is equal to the actual total. (If we had an a priori expectation, e.g. if we 
had the Iiypothesis that there was one choke per cycle, then we wouldn’t have 
lost this degree of freedom). Entering the table for x** with 3 degrees of freedom, 
we see that to reach the 5% level of significance we would have to have a x* of 
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7.82. It is clear that the present data is in excellent accord with the hypothesis 
that the frequency of choking is independent of the method of loading. 

It will be noted that Table 4 at first sight is a 2 X n table (see Section (d)). 
In a 2 X n table the frequencies being compared relate to the same general class 
of events. Here, however, the occurrence of a cycle is not of the same class as 
the occurrence of a choke. Accordingly we are concerned with the 1 X n table 
of chokes occurring and only use the other row to derive our expected frequencies. 

(e) The 2x2 Table 

A problem frequently occurring can be reduced to the following form : — 

One type of process produces (a) defectives in a sample of (a + b) individuals : 
another type of process produces (c) defectives in a sample of (c + d) individuals. 
Are the two populations the same or different ? 

This type of problem can occur, for example, in cordite manufacture, where 
the detectives might be sticks rejected for e.g. surface blemishes, and we are 
interested in whether e.g., one method of packing the press cylinder gives less 
rejects than another. 

The problem is formally exactly the same when articles are being tested with 
go-not go gauges instead of continuously’ measuring gauges ; if an article fails to 
pass on one or the other of the gauges it is a defective, if it passes the gauges it 
is a non-defective. 

To take an example, suppose that in our first sample of 1000 there were 
100 defectives (fraction defective 10%) and in the second sample of 500 there 
were 60 defectives (fraction defective 12%). Are we justified in taking it as 
reasonably certain that the populations represented by these two samples are 
different ? 

Our contingency table then has the form 



Defective 

Non-Defective 

Totals 

Process A 
Process B 

a= 100 
c= 60 - 

b = 900 
d = 440 

a + b = 1000 
c + d= 500 

Totals 

a c = 160 

b + d = 1340 

1500 


We must calculate the expectation for each cell : it is directly proportional 
to the appropriate row and colunm totals, and inversely proportion^ to the grand 
total, i.e. the expectations for a, b, c and d are 
160 X 1000 ^ 


1500 
1340 X 1000 


Accordingly 


1500 


= 893.3, 


160 X 500 


6.7* 


+ 


6.7» 

893.3 


+ 


1500 

6 . 7 ^ 

53.3 


+ 


= 53.3, and 


6.7> 


1340 X 500 
1500 


= 446.7 


446.7 


^ " 106.7 
= 1.4 

For one degree of freedom (given the marginal totals, we can only fill in 
one cell arbitrarily, for then the others are uniquely determined) this is about 
the 20% level of significance, i.e. the apparent superiority of process A is not 

established by this experiment. . 

We omitted in the above example to apply the correction for continuity . 
with large numbers such as are in the above example this is permissible, for the 
resulting difference is, only slight. 
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In the above example we calculated the cell expectations separately, and 
summed the series of terms for x®* There is, however, a convenient mechanical 
method, which merely involves calculating 

^ (ad — bc)^ X N 

^ ~ (a + b) (b + d) (c + d) (a + c) 

where a, b, c, and d have the meanings as in the contingency table quoted above 
and N — a + b + c+d. Thus in the above example 
, (100 X 440 — 900 X 60)2 x 1500 

^ ~ 160 X 1340 X 500 X 1000 

= 1.4 

as before. 

With small sample sizes it is essential to apply the correction for continuity, 
as in the following example. 

A pressing process using one batch of powder gave 9 defective pellets out 
of 45 pressed (20% defective) and with a second batch of powder 29 defectives 
out of 87 pressed (33% defective). 



Batch A 

Batch B 

Totals 

Non-Defective 

58 = a 

36 = b 

94 

Defective 

29 = c 

9 = d 

38 

Totals 

87 

45 

132 


As before, the method of correcting for continuity is to add J to the observa- 
tion in those cells which are below expectation and subtract J from those above 
expectation. The marginal totals are, of course, unchanged by this operation. 

, (58.5 X 9.5 — 28.5 x 35.5)» x 132 , . . 

= 94 x45 xl8 x 87 = 

For 1 degree of freedom this is quite non-significant. If we had neglected 
to apply the correction for continuity we would have obtained a value for x* of 
2.57, in this case still non-significant. In borderline cases, however, the difference 
may be important(2). 

(f) The 2 X n Table 

The data in Table 5.2 below refers to two types of incident, two certain types 
of breakdown, A and B, which can occur in a certain process. The process can 
be run on three grades of raw material, L, M, and N. 


Table 5.2 



L 

M 

N 

Totals 

A 

42 

13 

33 

88 

B 

20 

8 

25 

53 

Totals 

62 

21 

58 

141 


(*) An intere.sting discussion of how faulty conclusions can easily be i cached from 2x2 
table is by J. Berkson on hospital data : Biometrics Bulletin, Vol. 2, No. 3, page 47, 1946. 
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We form the hypothesis that there is no association of type of incident with 
raw material. In other words, that the relative likelihood of incidents is the same 
for the three types of raw material. We proceed to test this hypothesis with the 
X® test. 

The expectation for the cell L A is 62 X 88/141 = 38.7 and the contribution 
to X® is (42 — 38.7)®/38.7 = 0.28. Similarly, the expectation for the cell M A 
is 21 X 88/141, etc. 

The degrees of freedom for ni X ng tables are (n^ — 1) x (n, — 1), or here 
(2 — 1) (3 — 1) = 2. The value of x* comes out, summing for all six cells, as 
1.5, which is very much less significant than the 5% level. Accordingly we can 
conclude that the data is in excellent accord with our hypothesis. 

Non-statistically, we would have calculated the ratios A/B for raw material 
classes L, M, and N as 42/20 = 2.10, 13/8 = 1.63, and 33/25 = 1.32. We 
would probably have concluded that these ratios were significantly difiFerent. The 
use of the x® test, however, shows that such a conclusion would be unwarranted. 

As for a 2 X 2 table, so for a 2 x n there is a convenient mechanical method 
for computing x®* Suppose the frequencies in pairs of cells are a and A, b and B, 
etc., and the corresponding totals n and N. Then we calculate for each pair 
p = a/(a + A) and 
ap = a2/(a + A) 
and from the totals 

P = n/(n + N) 

and then 

X* = p^p ]_ P (ap) — nP] 

Our three pairs give as values of ap 

42V62 = 28.452, 13 V21 = 8.048, and 33V58 = 18.776 
2 (ap) is then 55.276 

and P = 88/141 = 0.624 
and nP = 88V141 = 54.972 

Whence x* = (55.276 — 54.972)/0.624 x 0.376 = 1.51 as before. 

(g) The Hi X n, Table 

An Hi X nj table, with n^ = 3, n, = 6 is given below. 



a 

b 

c 

d 

e 

f 

Total 

A 

11 

23 

8 

5 

18 

'l8 

83 

B 

17 

29 

IP 

17 

7 

15 

95 

C 

6 

21 

8 

24 

15 

9 

83 

Totals 

34 

73 

26 

46 

40 

.42 

261 


A product was being produced on three shifts. A, B, and C and was classified 
into six grades, the order of quality being a to f. 

On the average each of the three shifts would be supplied with raw material 
of the same average quality. The question arose as to whether any shift was 
tending to produce better quality material than the others, i.e. was there a relative 
excess of particular qualities on any one shift. 

To test the hypothesis that there is no association of quality with shift, we 
calculate x® for table. The expectation of the cell Aa is 83 X 34/261 = 10.8, 
and the contribution to x® is (H — 10.8)V10.8 = 0.004. Similarly the expect- 
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ation for the cell Ba is 95 X 34/261 = 12.38, and the contribution to x* is 
(17 -- 12.38)V12.38 1.72. 

Performing the operation for all the cells and summing all the contributions, 
we get' for x^ the value 26.3. The degrees of freedom with which we enter the 
X* table is given by the number of independent ways the contingency table can be 
filled up, given the marginal totals. For an ni X n 2 contingency table this is 
(ni — 1) (n 2 — 1), here 5 X 2 = 10. Entering the x^ table for 10 degrees of 
freedom with the value 26.3 for x® we have the probability of getting so large 
a value as less than 0.01 (i.e. 1%). Accordingly the hypothesis that there is no 
association of grade with shift would seem to be incorrect, i.e. there is an association 
of quality and shift. 

The x^ test, should it give significance to the association between the two 
variables, does not give any information as to the type of association. This can 
only be found by inspection of the original contingency table. To assist in this 
it is convenient to enter in each cell, alongside the observed frequency, the expected 
frequency. Doing this for the present example, we find that A shift were 
producing a relative excess of e and f, B shift of a and C shift of d. 

(h) Restriction of Expected Cell Frequency to be not less than 5 

As mentioned at the beginning of this discussion of x“*» h is not safe to use 
the X® test where any expected class frequency is less than 5. Sometimes this 
means that the test cannot be applied. On other occasions the difficulty can be 
evaded. 

Consider the table below : 



Li 

L. 

M 

Ni 

N, 

Total 

A 

5 

37 

13 

28 

5 

88 

B 

3 

17 

8 

20 

5 

53 

Totals 

8 

54 

21 

48 

10 

141 


It represents the data from which Table S in section (f) was built up. The 
expectations for cells LiB, and N^B are both less than 5. As the table stands, 
therefore, the x* test cannot be applied. However, if we pool the data in columns 
Li and and in columns Nj and Nj, we get the 2x3 table quoted in section (f), 
Table 5,2, which is perfectly satisfactory. 


CHAPTER VI 

THE POISSON DISTRIBUTION 
(a) Introduction 

Consider a manufacturing process in which an operation is being repeated 
a great many times each day, and occasionally an inflammation occurs. If we 
set out in time sequence the daily number of inflammations we may get such a 
set of figures as 

00100203000130140002 

Some days will be without any, others will have a relatively large number. The 
question we will discuss is whether the probability of obtaining an inflammation 
is constant throughout. Prime facie we might consider that the days with 0 or 1 
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incidents have low probabilities of obtaining an inflammation and the days with 
3, 4 or more high probabilities. If this were the case, then we could look for 
assignable causes of variation in the probability, i.e. find out in‘ what way the 
days with high numbers of inflanimations differed from days with low numbers, 
with a view to correcting the manufacturing practice. On the other hand, if the 
probability were really constant throughout we should be wasting our time trying 
to find out in what ways the days differed. 

If we assume that the probability of obtaining an incident at any one instant 
is small but finite and constant, then over a finite period the chance of obtaining 
an incident will be appreciable. The incident can be of any type (we are con- 
sidering spontaneous ignitions in a manufacturing process) and can include the 
occurrence of defective articles in their mass production. In this latter case we 
must pay due regard to the proviso that the probability of obtaining an incident 
at any one instant (in the terminology of this example of obtaining a defective on 
selecting any particular article) is small. In practice the approximation is good 
enough if the percentage defective is less than 10%. 

(b) Number of Incidents per Interval 

To revert to our example of spontaneous inflammation, let us count up the 
number of days on which no incident occurs, on which 1 incident occurs, on 
which 2 incidents occur, etc. Then if our hypothesis is correct, the relative 

® fiequencies will be as in Table 6.1 below (the e is the base of natural logarithms, 
s the average frequency, and e.g. 4 ! is factorial 4, i.e. 4 X 3 x 2 x 1). ' 

Table 6.1 


Number of 
Incidents 
in one day 

Relative 

Frequency 

0 

e-m — 

1 

me-m/l 1 

2 

' m*e-m/2 ! 

3 

m*e-m/3 1 

4 

m*e-m/4 I 

etc. 

etc. 


This series of terms is known as the Poisson series. 

The sum of the Relative Frequencies is unity, for it is 

e~m + m erm/l ! -f m^e-m/2 I + m®e-m/3 ! + 

= e-m [1 + m/1 ! + mV2 ! + m®/3 ! + ] 

= Orta X em = 1 

The first two columns .in Table 6.2 summarises the results observed on 201 
days working of an industrial process. The incident was an inflammation that 
would occur in a repetition process that was being repeated a large number of 
times each day. Over this period of 201 days there were 150 incidents, and they 
were distributed as in Table 6.2 Thus there were 102 days without any incidents, 
59 days with 1 incident, 31 days with 2 incidents, etc. The total number of 
days is given by 

102 + 59 + 31 + 8 + 0 + 1 == 201 
and the total number of incidents is given by 

102 x0 + 59 xl + 31x2 + 8x3 + 0x4+1x5=^ 150 
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Table 6.2 


Number of 

Number of 

Expectation 

Incidents 

Occurrences 

of 


on one Day 

of this type 

Occurrences 

0 

102 


95.3 


1 

59 


71.1 


2 

31 


26.5 


3 

8' 


6.6" 


4 

5 

0 

1 


1.3 

0.2 

.8.1 

6 

0. 


oj 


Total 

201 

201.0 


• We have 150 incidents in 201 days, so the daily average frequency m is 0.7463, 
A table of natural logarithms gives us em as 2.109 : e^m is the reciprocal of this, 
namely 0.4742. The initial term of the series, corresponding to 0 incidents in 
a day is Ne-m == 201 X 0,474 = 95.3. The next term is mNe-m/1 ! = 
0,7463 X 95.3 = 71.1. The next term is m*Ne-m/2 I == 0.7463 X 71.1/2 = 
26.5, and so on. These Poissonian expectations are given in the last column of 
Table 6.2. 

To test the significance of the departure of the observed frequencies from 
the expected frequencies we use the x® test in the usual way, as for a 1 X n table. 
Thus the first cell’s contribution to x® is 
(102^95.3)» __ 

95.3 

and similarly for the other cells. In applying the x* test it is essential to pool 
sufficient cells at the bottom end of the table to satisfy the condition that the 
expected frequency in any cell should not be less than 5. 

A further essential condition in applying the x* test is that we should take as 
the degrees of freedom (n — 2). The reason for losing 2 degrees of freedom as 
compared with 1 as in the usual n X 1 table is that we have not only made the 
totals agree (as in the 1 X n table, this taking up 1 degree of freedom) but also 
used one constant in fitting the shape of the distribution, this latter taking the 
second degree of freedom. 

In the above example we obtain a value of 3.4 for x* with 2 degrees of freedom. 
This corresponds to a probabiilty of 15%, so the fit can be considered satisfactory 
(P would have to be 5% or less before we could consider the hypothesis of agree- 
ment between the observed and expected distribution disproved). 

It will be noted that the shape of the Poisson distribution depends on the 
value of m, the average frequency. If m is less than 1,, as in the previous example, 
then each term must be less than the preceding one, and the first term will be 
the largest. 

If m is greater than 1 but less than 2, then the second term is larger than 
the first, since it is equal to the first multiplied by m and divided by 1. It will 
also he greater than the third, so it is the maximum term of the series. If m 
is greater than 2, but less than 3, then the third term will be larger than the second 
and larger than the fourth. 

Table 6.3 below is derived from the same data as the previous example. The 
interval taken was 3 days, so the average frequency is now 2.2537. 
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Table 63 


Number of 
Incidents in 
periods of 
three days. 

Number of 
Occurrences 
of this type. 

Expectation 

t>f 

Occurrences. 

0 

8 


7.04 


1 

20 


15.86 


2 

13 


17.87 


3 

12 


13.42 


4 

6 


7.56 


5 

5 1 


3.411 


6 

2 

0 O 

1.28 


7 

0 

^ o.U 

0.41 

^5.24 

8 

1 J 


0.11 


9 






It will be observed that the expected frequencies correspond closely to the 
►bserved frequencies, — ^-49 for 4 degrees of freedom, this corresponding to 
L probability of 40%. 

(c) Distribution of Time Intervals 

We have considered the number of incidents per time interval. A slightly 
iifferent way of looking at the matter is to consider the size of interval between 
ach incident. The interval can range from zero up to ten days or more. If 
ve assume that the probability of obtaining an incident is constant throughout, 
ve can predict the relative frequencies of the various sizes of intervals. Let m 
3 e the reciprocal of the average interval, and N the total number of intervals 
observed. Then the number of intervals laying between t, and tg, would be given by 
N (e-mtj — e-^tg) 

vhere e is the base of natural logarithms. 

Table 6.4 gives the distribution of time intervals, derived from the same data 
IS the previous table. 


Table 6.4 


Intervals 

(hours) 

Number 

Observed. 

Number 

Expected. 

0— 7.9 

48 

44.46 

8—15.9 

36 

34.63 

16—23.9 

27 

26.95 

24—31.9 

24 

21.00 

32—39.9 

10 

16.44 

40—47.9 

13 

12.64 

48—55.9 

10 

10.01 

56—63.9 

8 

7.64 

64—71.9 

9 

6.03 

72—79.9 

3 

4.76 

80 to infinity 

13 

16.45 

Total 

201 

201.01 


7 
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The computation for this table is easier if we take 8 hours as a unit of time. 
Using a finer distribution of intervals than is shown in Table 6.4, the average 
interval is 3.9950 units and m its reciprocal was found to be 0.2503. 

The expectation for the first interval is then 

N(e-*niti — e-rntg) 

= 201 ^ ® e ^ 

= 201 (1 — 0.7788) 

= 44.46 

The expectation for the second interval is then 

N(e"mt2 — e-mtg) 

= 201 ^ ^ ^-.0.2608 X 

= 201 (0.7788 — 0.6065) 

= 34.63 

The complete expectations are given in the last column of Table 6.4. 

The closeness of fit can be tested with the x* test. Here a value of 6.13 is 
obtained which with (11 — 2) = 9 degrees of freedom corresponds to a probability 
to 75%. The fit is thus excellent. 

The time distribution of incidents when expressed in the present form is 
rather curious in that the commonest interval is the shortest. 

It is interesting to note that the two methods test rather different aspects of 
the distribution. A large number of long periods (say 4 days or more) would 
show up in the present method as an excessive number in the last category, and 
since the expected number in this category is small it would not require many 
to give a significantly large contribution to x*- The first method described, 
however, would record this in the category of days with zero incidents, and since 
this is frequently a large class the additional number would have to be very large 
to make this cell give a significantly large contribution to x®* Clearly a number 
of days with zero incidents is more remarkable if they occur as a series consecutively 
than if they are interspersed with days with incidents, and the latter treatment is 
more sensitive to this form of departure from the Poisson distribution. On the' 
other hand, the former treatment is more sensitive to the occurrence of a large, 
number of incidents in a short time. 

CHAPTER VII 

THE ANALYSIS OF VARIANCE 

(a) Introduction 

It is a valuable property of variance that if a process has a number of factors 
each making a contribution to the variance of the final product, then this total 
variance is equal to the sum of the component variances (^). 

This statement is less obvious than it may seem. Thus, if we were using 
standard deviations as our measure of variability, it would not be true to say that 
the standard deviation of the final product was equal to the sum of the standard 
deviations produced by the several factors. 

This property of additiveness of variance makes possible the technique known 
as the “Analysis of Variance,’' whereby the total variance of a process can be 
analysed into its component factors, the relative importance of which can then be 
assessed. 

The Analysis of Variance can take many forms according to the structure of 
the process being analysed, and one of the principal difficulties usually found in 
its application is deciding which is the appropriate form. In the present chapter 
only the two simplest forms will be discussed, and in the succeeding chapter a 
more general^ treatment for dealing with the more complex cases frequently met 
in practice will be attempted. 

(^) Assuming the Factors are acting independently. 
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(b) Analysis of Variance Between and Within Batches 
Suppose that we have a two stage process. Stage A takes the raw material 
in batches of 300 lb. at a time and performs operation A on it. This operation 
is not perfectly reproducible and gives rise to a variance a ^ in the final product. 
For stage B of the process, each 300 lb. batch is split into three 100 lb. units and 
operation B performed on these units. Operation B is also not perfectly repro- 
ducible and gives rise to a variance a | in the final product. The problem consists 
of observing the total variance in the final product and analysing it into its two 
components a I and Having done this, we will then be able to decide which 
is the part of the process which requires the more careful control in order to 
reduce the total variability. 

In evaluating g I and o J we proceed by first calculating (n i + ^b) and 
G j-j (in the generi' case where there are n units per batch). The reason for this 
is that it makes the estimation of the significance of g^ practicable. The function 
(n <y i ) has (m — 1) degrees of freedom, where m is the number of batches, 
and has m (n — 1) degrees of freedom. For to exist, therefore, (n gx + a|) 
must be greater than and we can readily test this with the variance ratio test. 

In Table 7.1, below, the value of the property for each unit has been set down 
in columns of 3, the three units in each column coming from a single batch. 

In order to reduce the numbers to small numbers (for convenience in arith- 
metical computation) a constant has been subtracted from all of them : it is 
obvious that this shift in level will in no way affect the relative variability. 


Table 7.1 



—3 

— 4 

—3 

— 1 

4 

—2 

1 

2 


Grand 


—2 

—3 

—1 

3 

3 

3 

0 

1 

— 1 


Total 


—3 

—5 

—4 

2 

6 

1 

1 

1 

2 



Totals 

—8 

—12 

—8 

4 

13 

2 

2 

4 

0 

1 

—2 


The process of computation is as follows: — 

(1) Square the individuals and add, i.e. 

(_3)2 + (~-4)* -f {-—Sy + . . . + P + 22 + 1* = 204 

(2) Obtain the total for each column, square these totals, sum these squares, 
and divide this total by the number of individuals in each column, i.e. 

((-,-8)2 -b (—12)2 _|_ (__8)2 + . . . + 02 -b l2)/3 = 160.67 ^ 

(3) Obtain the grand total for all individuals, square this grand total, and 

divide by the grand total number of individuals, i.e. . 

(—3 —4—3 -fl + 2-1- 1)2/30 = 0.13 

We then form a table of this analysis of variance : — 


Table 7.2 


Source of 
Variance 

Sum of Squares 

Degrees of 
Freedom 

Mean 

Squares 

Components 
of Variance 

Between columns 

(2)— (3)= 160.54 

m — 1 = 9 

17.84 

n gI 4- 

Within columns 

(1)— (2) = 43.33 

mn — m = 20 

2.17 

<4 

Total . . 

(1)— (3)==203.87 

mn — 1 = 29 
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m = number of columns, 
n = number of individuals in each column. 

Variance due to differences between columns. 
al — Variance within columns. 

The three Sums of Squares are the differences between the appropriate 
terms (1), (2) and (3), calculated above. The degrees of freedom are derived 
as follows : — 

(a) Between Columns, There are m columns, hence the degrees of freedom 
for this term is (m — 1) (the degrees of freedom in general equal the 
number of ways the group concerned can be arbitrarily filled in if the 
total is determined, thus when (m — 1) have been filled in the last one is 
uniquely determined, i.e. we can only fill (m — 1) classes arbitrarily.) 

(b) Within Columns. There are n individuals in each column : each column 
therefore contributes (n — 1) degrees of freedom to this term : there are 
m columns, hence the total degrees of freedom is m (n — 1) = mn — m. 

(c) Total. The total degrees of freedom are mn — 1, one less than the total 
number of observations. 

The Mean Squares are the Sums of Squares divided by the degrees of freedom. 

The word ‘‘Squares” in the column headings in the table refers to Squares 
of deviations from means, and so the Mean Squares terms have some of the 
characteristics of variances. 

The last column shows that the Between Column Mean Square estimates 
(n + a|) and the Within Colunm Mean Square estimates a^. Thus if 
exists significantly, then the Between Column Mean Square must be significantly 
greater than the Within Column Mean Square. 

Accordingly, we wish to. test whether the Between Column Mean Square is 
si^ficantly greater than the Within Column Mean Square. This can be done 
with the Fisher variance ratio test, discussed earlier in Chapter IV (a). 

We merely calculate the ratio of the larger mean square to the smaller, here 
17.84/2.17 = 8.22, and enter the table for F with ni = degrees of freedom of 
the^ larger variance (here ni = 9) and n 2 = degrees of freedom of the smaller 
variance (here n^ = 20). If our value for F is greater than that given in the table, 
then the result is more significant than the level of the table. Here for example, 
for ni = 8 (the table does not give = 9), ng = 20, F has the values 2.45, 3,56 
and 5.44 for the 5%, 1% and 0.1% levels of significance. Accordingly this 
particular result is more significant than 0.1%. 

Having thus established that the Between Columns Mean Square is signi- 
ficantly greater than the Within Column Mean Square, i.e. that the Between 
Column Variance does exist, we can proceed to calculate its value. We have 
n ai + = 17.84 

4 =.2.17 

where n = number of individuals in each column, here 3. 

ThusV^= (17.84 -~2.17)/3 
= 5.22 
a| = 2.17 

The total variance observed in our original data is thus made up of these 
two components = 5.22 due to differences between columns (here columns = 
batches) and (j| = 2.17 due to variability within columns (here within batches, 
that is to say, between units for a given batch). 

(c) The Investigation of Multi-Stage Processes 

We can thus see that the larger proportion of the total variability in the above 
process is arising at stage A, and only a minor part at stage B. Thus in any 
attempt to standardise the process, to reduce the total variability in the final 
product, almost all the attention should be directed towards stage A. 
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The total variance as the process is at present is ci = <jI + ag = 5.22 + 
2.17 = 7.39, i.e. its standard deviation is V7.39. Our estimate of the spread 
within which 95% of all individuals should lie is thus i 1.96 X V7.39 = ± 5.3, 
or a total spread of 2 X 5.3 = 10.6. From inspection of the original data this 
is seen to be reasonable : out of the thirty observations the smallest is — 5 and 
the largest is +6, a spread of 11.0. 

Suppose we directed our efforts at standardising stage B, and were com- 
pletely successful, and reduced a| to zero. The total variance would now be 
<j| — 5.22, i.e. the standard deviation would be 2.29 and the 95% spread 
di 1.96 X 2.29 = ± 4.5, or the total spread of 9.0. The total spread originally 
was 10.6, so the improvement has been comparatively slight. 

If, on the other hand, we had directed our efforts at standardising stage A 
and were completely successful in reducing al to zero, the total variance would 
now be cf| ~ 2.17 or the standard deviation 1.47. The 95% spread would then 
be d=1.96 X 1.47 == ±2.9 and the total spread 5.8. 

The improvement effected by standardising stage A is thus very much more 
substantial than that achieved by standardising stage B. It is clear that effort 
spent in investigating stage B would have been largely wasted. Before carrying 
out a detailed physico-chemical investigation on a multi-stage process, it is clearly 
desirable to find out with such an analysis of variance which are the stages giving 
rise to the variability, and concentrate upon them. The advantage of this pro- 
cedure is that the stages subsequently ignored may have contained a large number 
of variables, each of which would have required investigation. In carrying out 
these investigations, we could not be confident that we were including the signi- 
ficant variable, and the failure to obtain a significant correlation might be due to 
the investigation of the wrong variables rather than to the fact that there was no 
significant variable for this part of the process. 

The analysis of variance proceeds by concentrating initially upon the structure 
of the process, and only subsequently investigating individual variables. 

(d) Analysis of Variance of Columns of Unequal Size 

It will have been realized that the analysis of variance between and within 
columns is in effect an extension of the Student t test : instead of comparing 
two means we are comparing m means, where m is the number of columns. It 
might be thought that we might make a series of t tests for the column means 
taken two at a time. This would involve a lot of labour, and in addition involves 
us in certain complications. Thus if we had 20 comparisons, and we are working 
to the 5% level of significance, then on the average even if tx. columns were 
drawn from the same population we would expect one of them to* give us a value 
of Student’s t as large as the 5% value. The use of the analysis of variance 
evades these difficulties. 

It sometimes happens, however, that the number of individuals in each 
column are not the same, when the treatment given in Section (b) will not be 
directly applicable. 

Consider the data in the table below. It represents the throughputs obtained 
from units of plant before failure through corrosion. The units are categorised 
according to the foundry manufacturing them. 


Foundry 

Throughput obtained 

Total 

Mean 

A 

84, 60, 40, 47, 34 

265 

53.0 

B 



78.3 

C 

46,93,100 

239 

79.7 
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Prima facie it might be concluded that the pots from Foundry A are giving 
lower throughputs on the average than those from Foundries B and C. 

The analysis proceeds by obtaining the following terms : — 

( 1 ) Square the individuals and add, i.e. 

842 4 . 602 + 402 + 4 - 932 ^ 1002 == 95,709 

(2) Square each column total, divide by the number of individuals in that 
column, and sum these terms, i.e. 

265 V5 + 7052/9 + 2392/3 = 88,310 

(3) Square the grand total and divide by the total number of individuals, i.e. 
(265 + 705 + 239)2/17 = 85,981 

The analysis of variance is then as below : 


Source of Variance 

Sums of Squares 

Degrees of 
Freedom 

Mean 

Squares 

Between Foundries 

(2)— (3) = 2329 

2 

1164 

Within Foundries 

(1)— (2) = 7399 

14 

528.5 

Total 

(1)_(3) = 9728 

16 



The degrees of freedom are derived as follows : — 

(a) Between Foundries : one less than the number of foundries. 

(b) Within Foundries : each foundry contributes one less than the number 
of observations on that foundry, viz., 4+8 + 2= 14. 

(c) Total : one less than the total number of observations. 

Testing the Between Foundries Mean Square in the above analysis, it gives 
a variance ratio of 1164/528.5 = 2.20 for degrees of freedom ni = 2, ng = 14. 
This is considerably less significant than the 5% level of* significance, so there is 
insufficient justification for regarding the Foundry A pots as giving smaller average 
throughputs than those from the other foundries. 

In the present instance we are more interested in the actual averages for the 
three foundries, and are using the analysis of variance to test the significance of 
the apparent difference between the averages. In the more general case, of the 
analysis between and within batches, as discussed in the two previous sections, 
we probably wish to calculate the two components of variance. The Within 
Fotmdries Mean Square as before estimates the Within Foundries variance, i.e. 

CT^== 528.5. The calculation of the Between Foundries variance is more com- 
plicated when the batch (column) sizes are unequal. 

If N in the total number of observations, and m is the number in each batch 
for i = 1 to k, there being k batches, then the Between Foundries (Between Batches 
or Columns) Mean Square estimates 


N2-Sni2 

N(k-l) 

It will be noted that in the case where all the batches (columns) are of equal size, 
i.e. all the ms are equal to n this reduces to 
(n k )2 k n 2 2,2 
nk(k— 1 ) + 


2 I 2 

— n ajj + 

as quoted in Chapter VIII fb). 
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or = 124.3 

It might be re-emphasised that the calculation of the component of variance 
had been given here only to illustrate the method of calculation. In practice 
with this particular pioblem it would not be done because firstly we would be more 
interested in the averages, secondly the effect is so non-significant that its calcula- 
tion would be meaningless, the component being indistinguishable on the present 
data from zero. It further would not be desirable practice to estimate the variance 
betwe m batches (column) from only three batches : a variance so estimated would 
have a very large error. Finally, this is a case where it might be held that it 
was unreasonable to suppose that there was an infinite population of foundries 
from which these three are a sample : this point is commented on further in 
Chapter XII (b). 


(e) Analysis of Variance into Components due to Rows, Columns, 
and a Residual 

It frequently happens that not only can the individuals (units) be categorised 
according to batches (columns) but also simultaneously according to rows. Thus 
in the previous example, the first individual in every column (batch) was processed 
on a particular machine, all the second individuals in every column on another 
machine, and all the third individuals on a third machine. Each row thus corres- 
ponds to one particular machine. There may be a systematic difference between 
these machines ; we wish to test whether such an effect exists, and if it does exist 
estimate its contribution to the total variance of the final product. 

Formally what we have to deal with here is a twofold classification; every 
individual belongs to a particular row and to a particular column. 

Other examples of twofold categorisation are the output from g. shop accord- 
ing to machine and according to the shift operating the machines.^ Or, in 2 p. 
experiment on conditions of nitration of cellulose, we could have ni different acid 
concentrations, corresponding to rows, and n 2 different acid temperatures, corres- 
ponding to columns, where and no could be any convenient numbers. 

To revert to our example of batches split into three units, we proceed by 
calculating the following terms : — 

(1) Square the individuals and add (we have done this already — ^term (1) of 
the previous analysis). 

(2) Obtain the total for each row, square these totals, sum these squ^es, 
and divide this total by the number of individuals in each row (this is 
the new term, here equal to 8.40). 

(3) Obtain the total for each column, square these totals, sum these squares,’ 
and divide this total by the number of individuals in each column (term 
(2) of the previous analysis). 

(4) Obtain the grand total for all individuals, square this grand total and 
divide by the grand total number of individuals (term (3) of the previous 
analysis). 
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We then form a table of this analysis of variance. 


Source of 
Variance 

Sum of Squares 

Degrees of 
Freedom 

Mean 

Square 

Components 
of Variance 

Between 

(2)-(4) = 

ni — 1 = 2 

4.13 

n, 0? + <^ 

Rows 

8.40 — 0.13 




Between 

(3)-(4) = 

n, — 1 = 9 

17.84 

Hi <J| + a? 

Columns 

160.67 — 0.13 




Residual 

(l) + (4) 

(n,-l)(n,-l)=18 

1.94 

<Jo 


-(2) -(3) 




Total 

(l)-(4) = 

Ui n 2 — 1 = 29 




203.87 





nj = number of rows = 3 


n 2 = number of columns =10 
©2 variance due to differences between rows 
o® -- variance due to differences between columns 
= residual variance 

The Residual Sum of Squares can be obtained as that combination of terms 
indicated, but in practice it is easier to calculate the Row and Column Sums of 
Squares, and find the Residual as the difference between their sum and the Total 
Sum of Squares. 

To test for the significance of the Between Rows Mean Square we compare 
it with the Residual using the variance ratio test. Here the variance ratio is 
4.13/1.94 = 2.12 for degrees of freedom nj = 2, ng = 18. Referring to the 
tables in the Appendix we see that this is not significant. Thus our hypothesis, 
that there was no systematic difference between the rows (machines), is in reason- 
able accord with this data. If the Between Rows Mean Square had proved 
significant, we could have proceeded to estimate the magnitude of a?, the variance 
due to differences between rows (machines) from the pair of equations 
n* of + 05 = 4.13, 05 = 1.94, whence of = (4.13 — 1 . 94 )/n 2 , 

CHAPTER VIII 

THE QUALITY CONTROL CHART 

(a) Introduction 

The Quality Control Chart has been much used in recent years as a means 
of control of quality of production, particularly in light engineering processes such 
as the use of automatic lathes, presses, etc. It has not been used much hitherto 
in the chemical industry though certain types of process seem suitable for its 
application. Here it will only be discussed with reference to investigational 
work.(^) 

{^) The literature of industrial quality control is now very great. The standard work 
“Economic Control of Quality of Manufactured Product^' (Macmillan : 
1931). Short accounts are British Standard 600R “Quality Control Charts,” B. P. Dudding 
h 1942 ; British Standard 1008 “QuaHty Control” : “A First Guide to 

Quality Control for Engineers,” E. H. Sealy, H.M.S.O., 1946. A fuller account is L. E. 
Simon s An Engineer’s Manual of Statistical Methods” (Chapman & Hall, 1941). 

A discussion of quality control applied to routine chemical analysis is by H. E. MacColl : 
Chemistry and Industry, December 9th, 1944, page 418. 
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The quality control chart is designed to detect the presence of ‘‘assignable 
causes of variation” in a sequence of individuals. In effect it takes “rational 
sub-groups” (that is, a group of individuals -more likely to be similar than the 
general population, e.g. individuals from a particular machine, operator, day, 
consignment of raw material, etc.), obtains from these rational sub-groups a measure 
of the population variance, and then checks that the sub-group means do not 
vary more than they should do on the assumption that all individuals are drawn 
from the same population, i.e. that there is no significant difference between the 
sub-group means. It will be apparent that the result of this test is the same as 
the analysis of variance for between and within batches. 

The quality control chart as used in practice almost invariably derives its 
estimate of variance within the sub-groups not from the variance but from the 
mean raSige within the sub-groups, there being available factors for converting 
mean range to standard deviation (see Section (e)). Having obtained an estimate 
of or (the within sub-group standard deviation) we can assert that the means of 
samples of n will be distributed with standard deviation a/Vn. Thus 95% will 
be within 1.96<i/\/n on either side of X (the grand mean, assumed equal to the 
population mean. This is the same as 1 in 40 below X — 1.96<j/Vn and 1 in 40 
above X + 1.96a/Vn. The quality control chart normally uses these limits as 
“iimer” limits and also “outer” limits at 3.0 % these giving 99.^% within 
the limits, or 1 in 1000 below the lower and 1 in 1000 above the upper limit. 

If the mean of a particular sample falls outside the limits, it is probable that 
the underlying assumptions are not being satisfied. These assumptions are, of 
course, that the sample is drawn from a population of standard deviation a and 
grand mean X. The former point is checked by a control chart on the sub-group 
ranges (see next section), and therefore it is probable that the sample’s mean does 
differ significantly from the value X. 

(b) Within Batch Variability; the Control Chart for Range 

Consider the data used previously in Table 7.1, Chapter VII (b). We wish 
now to test the hypothesis that the variabilities within each column (batch) are 
drawn from a homogeneous population: in other words, that batches do not 
differ in their internd variability. 


Table 8.1 


Data 

—3 

—2 

—3 

—4 

—3 

—5 

—3 

—1 

—4 

—1 

3 

2 

4 

3 

6 

—2 

3 

1 

1 

0 

1 

. 2 

1 

1 

—1 

—1 

2 

—1 

1 

1 

X 

-2.67 -4.00- 

-2.67 

1.33 

4.33 

0.67 

0.67 

1.33 

0 

0.33 

w ! 

1 

2 

3 

4 

3 

5 

1 

1 

3 

2 

S(x) 

—8 

—12 

—8 

4 

13 

2 

2 

4 

0 

1 

S(x*) 

22 

50 

26 

14 

61 

14 

2 

6 

6 

3 

S(x— x)» 

0.67 

2.00 

4.67 

8.67 

4.67 

12.67 

0.67 

0.67 

6.00 

2.67 

2 

0.33 

1.00 

2.33 

4.33 

2.33 

6.33 

0.33 

0.33 

3.00 

1.33 


For this purpose we obtain the range w for each batch and plot these con- 
secutively on a chart as in figure 6. We then calculate the mean range w, here 
equal to 2.50, and plot on the chart an inner control line at D' o .»75 w = 2.17 
X 2.50 = 5.4 and an outer control line at D' o » 9 « w = 2.98 X 2.50 = 7.5. 
These D factors are obtained from Table V in the Appendix for sample size 3. 
If the batches are similar in their internal variability, then on the average not 
more than 1 out of 40 points will lie outside the iimer control line, and only 1 
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out of 1000 points outside the outer control line. The occurrence of two points 
outside the inner control line within two or three points, or of a single point 
outside the outer control line, therefore, is strong evidence that we cannot regard 
the batches as having the same internal variability. 
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Figure 6 . 


In the present instance it is clear from figure 6 that there are not points 
outside either of the control lines, so there is no reason to suppose that the batches 
differ in their internal variability. 


(c) The CJontrol Chart for Ranges compared with Bartlett’s Test 

To digress from the present theme, it will be noted that the control chart 
for range is testing exactly the same hypothesis as Bartlett’s test (see Chapter IV (c)). 
To apply the latter, we first calculate the variance of each batch ai* through the 
steps indicated, namely by obtaining S(x), S(x*), and thus S(x — X)*. The 
sum of the natural logarithms of the within batch variances oi* is r(log« 

= 3.06039. The degrees of freedom n of the individual variances are 2. The 
mean of the batch variances, S* is 2,164, and k, the number of variances being 
compared, is 10 . 


Whence we have 
B = k n log, 


C = 


S* — n S loge (cTi*) 

10 X 2 X loge 2.164 — 2 X 3.06039 = 9.31862 

1 + ^ ^ — 1 4- ^ _ 1 lei 

^ + k“^ + 3x2xl0“ 


Thus B/C = 9.319/1.183 = 7.87 and is distributed as 7 / with (k — 1) = 9 
degrees of freedom. This is much less significant than the 10% level of signi- 
ficance, so there is no reason to suppose that the batches differ from each other 
in their internal variability. 

The use of Bartlett’s Test thus leads to the same conclusion as the control 
chart for ranges. It requires more calculation, but on the other hand has the 
advantages of being more sensitive and precise. 
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(d) Between Batch Variability : the Control Chart for Means 

We wish to test whether the batches differ significantly in their means, having 
regard to the observed amount of variability of the individuals. 

We obtain the mean for each batch (first row in lower part of Table 8.1) and plot 
this consecutively as in Figure 7. We calculate the grand mean X as — 0.67, and 
draw in inner control lines at X ± A' 0-025 w = — 0.67 ± 0.668 X 2.50 or 1,60 and 
— 1.74 and outer control lines at X d: A'o-ooi w = — 0.67 di 1.054 X 2.5 or 2.57 
and — 2.70. Thus, if the Between Batch variability is no greater than that due 
to the Within Batch variability, 1 point out of 20 will lie outside the inner control 
lines and 2 out of 1000 outside the outer control lines. Here 4 out of 10 lie out^de 
the inner control lines and 2 out of 10 outside the outer control lines. It is thus 
evident that the hypothesis of no between batch variability is untenable. 
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Figure 7. 

This is in complete agreement with the conclusion reached using the analysis 
of variance in Chapter VII (b). With the control chart the arithmetical labour 
is much less, but ^e analysis of variance has the advantage of giving numerical 
estimates of the relative importance of the Between Batch and Within Batch 
variabilities, and of the exact significance of the existence of the Between Batch 
variability. For detailed and exact work, therefore, the analy^ of variance is to 
be preferred, but for preliminary and exploratory investigations tbe control chart 
is very useful. 

An important proviso in the use of control charts is that the control lines 
should be based upon at least 10 and, preferably, 20 batches : otherwise the 
estimate of the mean range w is not very accurate. 

(e) The Conversion of Range to Standard Deviation 

It frequently happens that we have a series of samples of n individuab drawn 
from a population and we wish to know their standard deviation. We can of 
course calculate the variance for each sample and take the average (since the 
samples are all of n individuals the simple average variance will be the same as 
that obtained by pooling the sums of squares and the degrees of freedom). The 
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arithmetic required is much less, however, if we obtain the range for each sample, 
calculate the average range, and divide this average range by the factor dn given 
in the last coliimn of Table V in the Appendix, selecting that dn corresponding 
to the actual size of sample. The resulting figure is the standard deviation. 
The standard deviation so calculated is an approximation and can only be used 
when the number of samples taken is reasonably large, say not less than 20. The 
estimate is then assumed to have nearly infinite degrees of freedom. 

It will be seen that except when the sample size is 2 we are sacrificing some 
accuracy in our estimate of <y, the compensating gain being the ease of computation, 
of course. This loss of information is not very serious, however, being less than 
20% for samples of size 10 and less for smaller samples. Under the conditions 
in which we use this method this does not really matter, as since we have such a 
large sample our estimate will be accurate enough for almost all purposes. 

The reader will note that it is this factor dn which is the basis of the factor 
A'o.025 and A'o.ooi used for the formulation of the control lines for means. Thus 
it is clear, that for a sample of size n, control lines for 1 in 40 limits (1 in 40 about 
the upper limit and 1 in 40 below the lower limit, i.e. 1 in 20 outside the limits) 
should be at 1.96<7/Vu, where a is the standard deviation, 1.96 is the value of 
t for infinite degrees of freedom (it is assumed that a is known exactly as we have 
supposedly taken a large number, preferably greater than 20, of samples) and the 
5% probability level, and n is the size of sample. If for example n is 4, and w is 
the mean range, then the limits should be at 1.96(w/2.059)/V4 = 0.476 w, where 
2.059 is the value of dn for n = 4. It will be seen that this is the value for A'o.ois 
given in Table V of the Appendix. 


CHAPTER IX 


THE RELATION BETWEEN TWO VARIABLES 


(a) Introduction (^) 

A problem that arises frequently is to determine whether an apparent relation 
between two variables is significant, and having shown it to be significant, to 
determine the best form of representation. 

The statistical methods available for dealing with this problem are not very 
satisfactory. The treatment in general is to test whether the data can be repre- 
sented by the equation for the simple straight line, y == a + bx, whether the 
deviations from tins straight line are significant and if so can these deviations be 
represented by the equation y == a + bx + cx* or y = a + bx + cx* + dx®, etc. 
In practice, the computational labour is excessively heavy for any of the steps 
but the first, namely, seeing whether the data can be approximately represented 
by a straight line of the form y = a + bx.(^) 

It is dear where the unsatisfactory nature of this treatment lies : there is 
no a priori reason why a complex system should give a straight line relationship 
between two of its variables, or even the more complex relationship y = a + bx 
+ cx®, etc. The case where the treatment is most appropriate is where the data 
when plotted on a graph has the appearance of an elliptical cloud, and we are 
dubious whether this ellipticity is genuinely due to a real relationship or an accident 
of sampling. Alternatively there might be little doubt about the genuineness of 


(9 A comprehensive survey of the field is given by W. E. Deming : “Statistical Adjust- 
ment of Data’’ (John Wiley : Chapman & Hall), 1943. 

r?- J where the independent variable is changing in equal stages: c.f. 

for Research Workers,” Sections 27 and 28. Also, G. Egloff 
R. C. Kuder, Journal of Physical Chemistry : Vol. 46, page 926, 1942 : and L. H. C. 
Tippett : Methods of Statistics” (William & Norgate), Section 9. 3. 
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the existence of a relationship but we want to have the best estimate of the 
relationship. 

If the data when plotted on a graph fall on a smooth curve of complex shape 
then the present techniques are not applicable. In practice, therefore, we shoiUd 
only use the present statistical treatment when the pairs of observations on the two 
variables plotted on a graph give something that looks approximately a straight line, 

(b) Transformations 

We may have a priori reasons, from the theory of the process, to suspect 
that the relationship should have the form 

Y = AB* 

which is equivalent to the expotential form 

Y = AeC* 

We can transform this to a linear function by taking logarithms, 
log Y = log A + X log B 

or putting log Y = y, log A = a, log B = b, we now have the relation in the 
convenient linear form. 

y = a + bx. 

Also, if we suspect that relationship should have the form : — 

Y = AX® 

where B can have any value, e.g. 2, when the relation would be 

Y = ax’ 

we can take logarithms 

Log Y == log A + B log X 

or putting log Y = y, log A = a, log X = x, we again have the relation in the 
convenient linear form 

y = a + Bx 

(c) The Correlation Coefficient 

To test for the significance of an apparently linear relation we calculate the 
correlation coefficient r defined by (S denotes summation over all pairs of 
observations) 

S(x — x)(y — y) 

V(S(x-x)‘ S(y-y)>) 

It has the characteristics such that if the relationship between the data can 
be represented exactly, by a straight line then r = ± 1, positive if the straight 
line has a positive slope and negative if the line has a negative slope ; if on the 
other hand there is no relation at all between the variables then r = 0. Even 
when the variables may have no relationship -we will usually get a non-zero value 
for r, since the accidents of sampling will often lead to an apparent trend one way 
or the other. We can, however, test whether an observed value for r is larger 
than would have been obtained accidentally in the absence of a correlation. We 
enter the table for r with degrees of freedom two less than the number of observa- 
tions (the reason for it being two less than the number of pairs of observadans h 
that we have in effect fitted two constants, a and b, to the data). If our value 
for r exceeds that given in the table for say, the 5% level of significance, then 
we may regard it as being 19 to 1 that there really is a correlation. 

We will illustrate the use of the statistical treatment of data suspected of being 
expressed in the form y = a + bx with the data partially (there are actually 
30 pairs of observations) given by 


y 

1.66 

2.0 

1.66 

1.78 

1.89 

1.82 . . . 

. . . 2.07 

2.29 

X 

6 

8 

17 

16 

19 

16 . . . 

. . .13 

12 
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In the table, y represents stick weight of cordite made from niteo-cellulose 
TOth acetone viscosities x. The variables have been transformed, however, to 
make the arithmetic Ughter ; y actually represents stick weight m ounces mmus 
13 lbs. 1 oz., and x actually viscosity in centipoises mmus 50. On plotting y 
against x one obtains a graph with a wide scatter but, apparently, a decided 
tendency for higher viscosities to give lower stick weights. 

We obtain the following terms (n = number of pairs of observations). 


S(x — x)*= S(x*)- 

= ( 6 * + 8 ‘ + . . • + 12 *) 
= 3213 

2(y — y)*= S(y*) — 

= ( 1 . 66 * + 2 . 00 * + . . 

= 9.060 

i:(x — X)(y — y) = S(xy) 


(6 + 8 + • • • + 12 )* 

30 


+ 2.29*) — 
SW S(y) 


(1.66 + 2.00 + . . . + 2.29)* 
30 


= (1.66 X 6 + 2.00 X 8 + . . .) — 
= -47.84 


( 1.66 + 2.0 + • • •) (6 + ® + • ••) 
30 


We then calculate the correlation coefficient r, defined as 
S(x — x)(y — y) 

- VS(x-x)* S(^-y)* 


—47.84 

V 9.060 X 3213 


= —0.280 


We see from the table for r (Table IV in the Appendix) that the probability 
of getting such a value for r in the absence of any correlation is about 0.10, that 
is to say 1 in 10 times we could get as large a value for r as we did here even in 
the absence of a correlation. The evidence for the correlation is thus madequate. 


(d) The Equation for the Regression Line 

If r proved to be significantly greater than zero we would take the matter a 
little further : although in the example considered here r was not significantly 
greater than zero we will use its data to illustrate the point. 

If we wished to use the data to predict values of y from known values of x 
•,ve would calculate 

a = = y = 49.58/30 = 1.65 


b = 

X == 


S(y — y) (x — x) _ —47.84 
S(x — x)* ■ 3213 

Sx 670 , 


—0.0149 


The formula for deducing the most probable value of y corresponding to a 
aiven value of x (technically known as the regression of y upon x) is then 
y = a4-b(x — x) 

= 1.65 — 0.0149 (x — 22.3) 

= 1.98 — 0.0149 X 
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This regression line of y upon x is the line which gives the squares of the 
deviation of every point, measured parallel to the y-axis in units of y, summed 
for all points, as a minimum. It gives the best estimate of y from a known value 
of X. 

If we wished to predict the value of x from a known value of y we would 
use the similar but different equation derived by calculating 

a'=^) 

n 

S(y — y) (x — x) 
s(y — y)* 

y 

^ n 

leading to the equation 

X = a' + b' (y — y) 

This line gives the deviations of every point, measured parallel to the x-axis in 
xmits of X, summed for all points, as a minimum. It gives the best estimate of 
X from a known value of y. 

Non-statisticians frequently object to the concept of two “best” lines, two 
regression lines. They hold that there must be one line which represents the 
“true” relation. That may be so, but we are not concerned with “true” relations : 
we are concerned with making the'best possible estimate of y from a known value 
of X, using a certain amount of experimental data to estimate the relationship. 
In one case we assume that we know x and wish to estimate y, in the other case 
we assume that we know y and wish to estimate x. These are two different 
operations, so it is not surprising that we use two different functions,- 


(e) The Residual Variance about the Regression Line 

We assume that we now have our regression line of y upon x with the points 
scattered about on either side of it. The standard deviation or of this scatter, 
measured in units of y p arall el to the y -axis, is given by 

= y'l-0.280«x = 

This figure affords a good measure of the usefulness of the equation for the 
regression line for predicting values of y for given values of x. 

Thus suppose we wish to draw 95% confidence limits on either side of the 
regression line, within which 95% of all points should lie, we look up the value 
of t for N — 2 = 28 degrees of freedom and the 5% level of significance, and 
calculate t ar = 2.05 X 0.546 = 1.12. We then draw in two lines parallel to 
the regression line, but one displaced 1.12 units of y downwards and one displaced 
1.12 units of y upwards. Thus if we use the regression line to predict the value 
of y from a known value of x, these confidence limits give us the limits between 
which we have a 95% chance of being correct in our prediction. (®) 


(f) The Use of the Analysis of Variance for Examining Regression 

An alternative method of testing the significance of a regression is by analysing 
the variance of the dependent variable y into its component factors. 

The total sum of squares of y is S(y — y)* = 9.060. The sum of squares 
accountable for by the regression line is 

[S(y-y)(x-x)]» _ 4^* _ 

S(x — x)» ~ 3213 

(*) This statement is approximate : the exact treatment follows in Section (h) 
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We then enter these into a table of analysis of variance as below : 


Source of 

Sums of 

Degrees of 

Mean 

Variance 

Squares 

Freedom 

Squares 

Regression 

0.7123 

1 

0.7123 

Residual . . 

8.3477 

28 

0.2981 

Total 

9.060 

29 



The degrees of freedom are 1 for the regression line, one less than the total number 
of pairs of observations for the Total, and the residual is the difference between 
these two. Similarly the residual sum of squares is obtained as the difference 
between the Total and the Regression. 

The significance of the Regression Line is estimated by comparing its mean 
square with the residual in the usual way. Here it lies between the 20% and 
5% levels. 

The residual is the same as the residual variance about the regression line 
that we calculated ea rlier in the previous section ; thus the square root of this 
residual variance is VO.2981 = 0.546 as obtaiiled earlier. 

It will be noted that the results obtained by the present approach are identical 
with those found by calculating the correlation coefficient. 

(g) Comparison of Regression Coefficients 

We might wish to test whether two regression coefficients were significantly 
different. Thus we might be conducting experiments upon some system, (a) 
with clockwise stirring and (b) with anti-clockwise stirring. We might plot 
graphs for the two series and suspect that they were different. 

We can make this test by using the fact that the variance of a regression 
coefficient is given by 

Residual Variance of y 
2(x — x)* 

i.e. in the present instance 0.2981/3213. The standard deviation is the square 
root of this, namely 0.0096, with two less than the total number of observations, 
namely 28, as its degrees of freedom. 

If the two regression coefficients b^ and b, have variances ai* and ctj*, then 
we calculate Student’s t as 
^ bx~b, 

\/a,» + a** 

and the degrees of freedom for this t are the sum of those of the two individual 
variances. 

(h) Exact Formula for the Residual Variance about the Regression 

Line 

As stated earlier, the formula given in Section (e) for the residual variance 
about the regression line is not quite exact. If is the residual variance as 
already defined, and ob® is the variance of the regression coefficient itself, calculated 
as in Section (g), then the exact formula (^) for a is 

a/ = (<Tr2 + XW) 

where X is the distance in units of x from the midpoint of x, x. For example, 

Journal of the American Statistical Association, Vol. XXV, page 139,. 
lyjO, gives a. full treatment of this and the more complicated cases of parabolas, etc. 
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in the case we have been considering, x is quoted in section (d) as 'ZL:6. In 
section (e) S(x — x)® is quoted as 3213, whence == 110.79. We have ax of 
the order of 10, and thus to select a point towards one end of the x scale we can 
take x + 2ax = 22.3 + 2 X 10 = 42.3 : this corresponds fo X = 20. Inserting 
the appropriate values for ar and cb we get 

= (0.2981 + 20^ X 0.00009278) 

= 0.3352 
whence a^i = 0.5790 

In the present instance, it is clear that the use of the approximate formula 
would have given results close to the truth, and of course the nearer we are to 
the midpoint of x the smaller is X and the less important the correction. 

(i) The Use of the Analysis of Variance for Checking Linearity 

A problem arising frequently is to determine whether a set of observations 
which to the first approximation can be represented by a straight line can be 
said to depart significantly from the straight line. This question can be readily 
settled if the data is in the symmetrical form suitable for the application of the 
analysis of variance. 

Consider the data below : — 


X 

10 

20 

30 

40 

y 

92.8 , 
93.0 

94.0 

94.3 

,95.1-""’^'' 

*""94.8 

94.9 

94.8 

y 

92.9 

94.15 

94.95 

94.85 


We can regard x as representing the temperature in of a certain part of 
a process and the two values of y repeat determinations on the yield at that tem- 
perature. We are interested in the effect of temperature upon the yield. To the 
first approximation the yield y appears to increase linearly with the temperature 
X, but the value for x = 40 is less than that for x = 30. We shall test whether 
this apparent departure from linearity is significant. 

We first transform the variables to give us smaller, and hence easier, numbers 
to work with.. Thus we can use X = x/10, and Y = (y — 92) X 10. The data 
then becomes : 


X 

1 

2 


4 

Y 

8 

20 

31 

29 


10 

23 

28 

28 


We can in the first instance analyse the variance of the data into between columns 
(given values of X) and within columns. 

Proceeding on these lines, we calculate the terms 

(1) = 8» + 10* + . . . + 28* = 4463 

(2) = [(8 + 10)* + (20 + 23)* + (31 + 28)* + (29 + 28)»]/2 = 445/5 

(3) = (8 + 10 + 20 + . . . + 28)*/8 = 3916.1 
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We enter these terms into a table as below : — 


Source of Variance 

Sums of 
Squares 

Degrees of 
Freedom 

Mean 

Squares 

Between Temperatures 

(2)-(3) = 
535.4 

3 

178.5 

Within Temperatures 

(l)-(2) = 
11.5 

4 

2.87 

Total . . 

(l)-(3) = 

546.9 

7 



Testing the Between Temperature Mean Square against the Within Tem- 
perature Mean Square, we see that it is significant. 

We now divide the Between Temperature Sum of Squares into that due to 
the linear regression line and that due to departure from it. 

We calculate 

£(X) =2(l + 2 + 3 + 4) = 2xl0 = 20 

2:(X») = 2 (1* + 2" + 32 + 4*) = 2 X 30 = 60 

2(X — 60 — 20V(4 X 2) = 10 

5:(XY) = U8 + (20 + 23) + 3 (31 + 28) + 4 (29 + 28) == 509 

£(¥ — ?) (X — X)) = HiXYJ— SQC) S(Y)/N 
= 509 — 20 X 177/8 = 66:5 

The factor 2 in the expressions for S(X) and S(X®) arises through there being 
2 observations on Y for each level of X. 


The sum of squares attributable to the regression line is then 
[S(Y-?)PL-X)]» _ 6^* _ 

s(x — X)» “ 10 


Since the total sum of squares attributable to differences between temperatures 
is 535.4, the sum of squares' due to deviations from the regression line is 
535.4 — 442.2 = 93.2 

The degrees of freedom for the regression line are 1 and for the deviations from 
it the remainder of those belonging to differences between temperatures. 

We enter up these expressions in a table as below : 


Source of Variance 

Sums of 
Squares 

Degrees of 
Freedom 

Mean 

Squares 

Between Temperatures : 




Regression 

442.2 

1 

442.2 

Deviation from Regression 

93.2 

2 

46.6 

Within Temperatures 




(Residual) 

11.5 

4 

2.87 

Total 

546.9 

7 
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The general significance of the regression line can be tested by comparing the 
Regression Mean Square with the Residual. Here it is clearly highly significant. 
To test whether the departure from the straight line is significant, we calculate 
the variance ratio for the Deviation from Regression as 46.6/2.87 == 16.2 for 
degrees of freedom = 2, n 2 = 4. This is nearly as significant as the 1% level, 
so we may take it with that degree of confidence that the data cannot be com- 
pletely represented by a straight line. 


The best estimate of the regression coefficient of Y upon X is given by 


S(X— X)* 


10 


(j) The Calculation of Correlation Coefficient, etc-, from Grouped 
Data 


If the number of pairs of observations is large, say about 100, the labour in 
calculating the correlation coefficient as in Section X (c) becomes considerable, 
and can be lightened by grouping the data on the same general lines as in Chapter 
II (j). 

We divide each axis into about 10 or 20 equal-sized groups, and coimt up 
the number of observations occurring in each cell. 

The data in Table 9.1 refers to a manufacturing -process involving the form- 
ation of an emulsion in water. The independent variable x is a function of the 
quality of the water (hardness) and the dependent variable y is a quality of the 
^al product. Table 9.1 is a graph with Ax representing the x-axis and Ay the 
y-axis. For each set of values of Ax and Ay, i.e. for each point on the graph, 
the number gives the number of observations occurring with those values (when 
none occur then the space is left blank). The column headed B gives the frequency 
with which the corresponding values have occurred. Thus to take Ax = 10, it 
will be seen that there are points with Ax = 10 at Ay == 23, 17, and 16, so the 
corresponding entry for Bx is 3. C is the arbitrary scale with shifted origin used 
to lighten the arithmetic. D and E are the terms B X C and B X respectively. 
The total number of observations is given by the sum of the frequencies in Column 
B, here 101. 


Thus we have 
2:(x — x)2 


S(y - y)2 


(Sx)» 

(Sy)* 


S(x») 

915- 

2(y)* 

875 — = 856.7 


The calculation of the term S(x — x) (y — y) is rather more difficult. For 
each level of Cy, we multiply the frequency for each cell by its value for Cx, and 
sum for all cells at this level of Cy. This is the column headed F. Thus for 
Cy = — 3, we have three cells, the first with unit frequency at Cx = — 3, the 
second with frequency 2 at Cx = — 2, the third with unit frequency at Cx = 1. 
Thus F = 1 X (—3) + 2 X (—2) +1x1 = —6, As a check upon the accur- 
acy of column F its sum should equal 2x. Column G is then obtained as the 
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product of Cy and F. Its sum is S(xy). To correct this into terms of deviations 
from the means rather than the arbitrary zeros we subtract the correcting factor 
Sx Sy _ (-25) X (+ 43) _ ,, 

N 101 

We thus obtain 

S(x — x) (y — y) = 399.0 — ( — 10.6) 

= 409.6 


The correlation coefficient is then 

S(x — x) (y — y) 

^ “ Vi:(x — x)’“ i;(y — y)^ 

409.6 

* “ V'(908.8 X 856.7) 

= 0.466 

which for 99 degrees of freedom is much more significant than the 0.1% level 
of significance. The regression coefficient of y upon x can be calculated as before 

3.S 

_ S(x — x) (y — y) 

S(x — x)" 

= ;:; - ; r ; r o = 0.451 . 

908.8 

and the regression equation then is 


= ^+b(x-^)=0.451 x + 0.537 


(k) Correlation and Causation 

A significant correlation coefficient can be taken zs an indication of associa- 
tion between two variables, but it is important to realize chat this does not auto- 
matically imply causation. 

Thus for example suppose we have plant data stretching over a period of 
some months, and we find that the temperature Ti at a certain stage varies quite 
considerably. We may plot Ti against the dependent variable y, obtain what 
seems a linear relation, calculate the correlation coefficient and find it significant. 
At this stage it would be dangerous to assume that y is necessarily a function of 
T,, for it may be that the real operating variable is the temperature Tj in some 
other part of the process, and that it happens that Ti has been related to Tj 
through some common operating factor such as weather. 

This type of error is always liable to occur when our data is existing plant 
records which we observe. It is less likely to occur if the data is obtained by 
actual experiment. Thus in an experiment if we raise the temperatore Ti ^ 
chosen value, and the running of the plant as measured by the dependent vanable 
y (which may be yield, quality, etc.) improves, then for practical purposes it may 
be sufficient to say “run the plant with Tx at this value.” However, this still 
may not be real evidence of the causative effect of Tx upon y, for the plant may 
be such that in raising Tx to the new value we have at the same time affected 
the real causative variable T^. Thus, for example, in a counter-current ptoc^, 
raising the temperature at one point will probably affect the temperature at ot er 
points. 

The considerations outlined in the last three paragraphs do not in any way 
invalidate the desirability of the testing for significance of any apparent relation. 
Thev merely imply that having shown a correlation to be significant, caution is 
necessary in assuming that this association is evidence of a causative efiFect of one 
variable upon the other. 
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(1) Conclusions 

Where it is suspected that the relation between two variables is approximately 
linear, there are three statistics which between them summarise the most important 
properties of the available data : — 

(a) the correlation coefficient, with the associated number of degrees of 
freedom, which is a convenient measure of the degree of reliability of 
the association. 

(b) the regression coefficient b that measures the slope of the regression 
line, i.e. the average increase in y per unit increase in x, and 

(c) the residual variance ar that measures the scatter of values of y about 
the regression line, i.e. the reliability of estimates of y for given values 
of X when estimated from the regression line. 

(d) If the experiment has been executed with appropriate replication, it is 
often desirable for the analysis of variance giving the significance of 
departure from linearity to be applied. 

In- using a regression equation it must be always remembered that it is only 
valid over the range of the independent variable which occurred in the data used 
in calculating it. Extrapolation is most unwise, except when there is a very 
sound theoretical basis. It is good practice to quote the range used of the 
independent variable in order to discourage extrapolation. (®) 


CHAPTER X 

MULTIPLE CORRELATION 


(a) Introduction 

We discussed in Chapter IX the determination of the relationship between 
a dependent and an independent variable In many of the cases which arise in 
practice, however, we find that there is more than one independent variable which 
may be of practical importance. In these circumstances it becomes necessary to 
isolate the effect cf each independent variable from the effects of the other 
^independent variables. 

In the example we shall consider the stack loss from a Pressure Oxidation 
Plant for making nitric acid (about 55%— 60%) by the oxidation of ammonia 
with air. The resulting nitric gases are absorbed by being passed upwards through 
a bubble cap absorption column, nitric acid being circulated down the column. 
The .absorption is exothermic, i.e. gives rise to the liberation of heat, and the 
temperature tends to rise. The absorption of the nitric gases is less efficient at 
higher temperatures, so to keep the temperature down water is circulated through 
cooling coils in the column. The column also tends to cool itself by radiation, 
etc,, to its surroundings, A small quantity of the nitric gases fails to be absorbed 
and escapes by the stack:- this we will regard as the dependent variable. Its 
minimisation is, of course, of prime economic importance and we will first deter- 
mine its^ dependence upon the two independent variables air temperature, Xa, 
and cooling water temperature, Xw. In a subsequent section we will consider a 
third independent variable, strength of the nitric acid being circulated in the 
absorption column. 

(b) Two Independent Variables 

We will determine the dependence of the P.O.P. stack loss (to be denoted 
by y) upon the cooling water temperature, Xw, and the air temperature, Xa. 

(^) A full discussion of various aspects of correlation is to be found in Chapters 11 to 16 
of Yule and Kendall s An Introduction to the Theory of Statistics’* (Charles Griffin). 
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Accordingly data was collected giving correS(ponding values for these three 
variables for 139 days. 

It was assumed that the stack loss was to a first approximation related linearly 
to the two independent variables^ so that it could be represented by an equation 
of the form 

y == ay.wa bywa (Xw Xw) “{“ bya.w (Xa Xa) 

The meaning of the symbols is as follows : — 

ay.wa is the value y would have if Xa and Xw were held constant at their mean 
values. 

byw.a measures the rate of increase of y with increasing Xw, Xa being constant. 

bya.w measures the rate of increase of y with increasing Xa, Xw being constant. 

If we had considered it more appropriate, we could, of course have used an 
equation of the type 

y == ay.wa 4“ byw.a (Xw^ Xw^) “b bya.w (Xa^ Xa^) 

which would be based on the assumption that the stack loss y increased as the 
square of the temperature, or in the first of the above equations we could have 
substituted for the temperature the reciprocal of the absolute temperatures, or 
whatever functions we considered most likely to represent the true behaviour of 
the system. 

Taking the straightforward linear equation we will determine the values of 
these three coefficients which give the best general fit to the data. We proceed 
as follows. 

It makes the computation lighter, particularly if we are not using a calculating 
machine, to use arbitrary zeros and to multiply by 10 or 100, etc., if necessary, 
to remove the decimal points. Thus in the present instance the variables used 
are actually : — 

y = 10 (stack loss in gm/M® — 3.0). 

Xw = water temperature in °C — 20°C. 

Xa = air temperature in °F — 50°F. 

For each set of obser\'ations on the three variables, y, Xw, and Xa, we form the 
three pairs of products, yxw, XwXa, and Xay. Thus we have a table of 139 sets 
as below ; — 


Date 

y 

Xw 

Xa i 

XwXa 

Xay . 

xwy 

30.12.43 

—20 

— 6 

—11 

66 

220 

120 

29 

— 9 

1 

— 3 

- 3 

27 

— 9 

28 

—18 

1 

— 2 

— 2 

36 

—18 

27 

— 6 

- 1 

— 1 

1 

9 

6 

25 

— 3 

2 

— 3 

— 6 

6 

— 6 

22 

— 5 

— 5 

—10 

50. 

so 

25 

22 

—20 

2 

—20 

—40 

400 

—40 

21 

+ 19 

5 

+ 14 

70 i 

266 

+95 

18 

— 4 

— 1 

—14 

14 

56 

4 


1479 

406 

1 

3533 

17340 

12797 


We require the sums of the six columns as ^Bown in the last row of the table. 
We will denote a summation by th^ fetter SL Thus, e.g. S(xwXa) is the sum of 
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the 139 XwXa's, here 3533. We also require the three sums of the squares of the 
individual observations on the three variables. We thus have : — 


Sy = 1,479 

Dys 

= 99,077 

Sxw = 406 

SXw'^ 

= 3,456 

Sxa = 1 

SXa^ 

= 10,525 

Syxw = 12,797 

SyXa 

= 17,340 SxwXi = 3,533 


So far these sums of squares and sums of products refer to deviations from 
the arbitrary zeros. We need to convert these into deviations from the means 
with the usual type of correcting factors. Thus, e.g. : — 

S(y-vV= S(y2)_(^“ 


= 99.077 -^ 
= 83,340.0 

and S(y — y) (xw — Xw) == S(yxw) 


( Sy) ( Sxw) 
N 

1479 X 406 
139 


= 12 797 — ' ^ 

’ 139 

= 8,477.0 

In a similar manner we obtain the other terms as deviations from the means. 
In order to have our symbolism more compact we will denote a summation of 
deviations from the means by S'. Thus 

^(y — y) (xw — Xw) = S'yxw 

and S(y — yY = S'y^ 

We thus have 

S'y2 = 83,340.0 : S'xw"^ = 2270.1 : S'xa® = 10,525.0 
S'yxw = 8,477.0 : S'yxa = 17,329.4 : S'xwXa = 3,530.1 

To ‘obtain the two regression coefficients byw.a and bya.w we have to solve 
the two simultaneous equations 

2 yXw = bywa . S'Xw® H- hya‘W . S'XwXa 
S yXa = byw-a • S XwXa ~f“ b ya*w» S Xa^ 

Substituting our values, we obtain 

8,477 = byw.a . 2270 + bya.w . 3530 (A) 

17,329 = byw.a . 3530 -f- bya.w . 10525 (B) 

To solve these two simultaneous equations we can; e.g., multiolv f A) bv the 
ratio 10525/3530 = 2.9816 and obtain 

25,275 == byw.a . 6768 -j- byasw . 10525 (C) 

and subtracting (B) we obtain 

byw.a - 7946/3238 - 2.4540 

To obtain the other regression coefficient we substitute this value for bywa in 
say, (A) and find ’ ' 

8477 == 5570.6 + bya.w . 3530 

whence 

bya.w = 0.8234 

Our equation for estimating y is thus 

(y ~ y) = 2.4540 (xw — Xw) + 0.8234 (xa ~ Xa) 
y — 10.6403 = 2.4540 (xw — 2.9209) + 0.8234 (xa — 0.0072) 
or y = 3.4665 + 2.4540 Xw + 0.8234 Xa. 

!^fore proceeding, it is as well to test the significance of the two regression co- 
empents byw.a and bya.w. This we can do with an application of the analysis of 


efficients byw.a and bya.w. 
variance. 


^ We first want the total Sum of Squares of the dependent variable v 
IS obviously r'y2 = 83,340. 


68 



We then calculate the Sums of Squares of y explained by the two independent 
variables independently of each other. These are 
(S'yxw)* _ 8477» 

S'xw* " 2270 

= StS- = 28,531.5 
S'x.» 10,525 


: 31,656.2 


and 


for Xw and Xa respectively. We enter the larger of the two in the first line of the 
table below. 

We next calculate the sums of squares of y explained by the two independent 
variables together. This sum of squares is 

byw.a S'yxw + bya.w S'yxa 

= 2.4540 X 8477 + 0.8234 X 17,329 

== 35,071 ^ ^ , 

Then the sum of squares explained by the addition of the second mdependent 
variable Xa is the difference between the sum of squares explained by Xw and 
Xa together and the sum of squares explained by Xw alone, i.e. 

35,071.0- 31,656.2 = 3414.8 . 

We finally enter up these various sums of squares in a table of analysis ot variance 

as below. 

Table 10.1 


Source of Variance 

Sums of 
Squares 

Degrees of 
Freedom 

Mean 

Squares 

Variation explained by Xw 

31,656 

1 

31,656 

Increment explained by the 

3,415 

1 

3,415 

addition of Xa 

35,071 



Total explained by Xw and Xa 



together 

Residual 

48,269 

136 

354.9 

Total 

83,340 

138 



The degrees of freedom are such that each regression coefficient t^es 1 and 
the residual takes what is left. Similarly the Residual sum of squares is what k 
left out of the Total after the total explained by the two regression coefficients 
has been subtracted. 

To test for the significance of the second (smaller) regression term we com- 
pare its mean square with the residual. Here the variance ratio is 3415/354.9 
= 9.6 with degrees of freedom ni = 1, na = 136. This is more sigmfic^t th^ 
the 1% level of significance, so there can be little doubt about the reality of the 

effect of Xa. 1 r 4 

Accordingly, to estimate the value of y for any particular values of Xw and 

Xa we use the equation 

y == 3.466 + 2*-4S4 fi.823 xa \ . , , . - • u 

and th« vmafice d^f ^^y estimate is given by the Residual vm^ce in 

table of analysis of variance, namely 354.9. The standard deviation will be the 

square root of this, namely 18.8. r % x? 

A modification, due to Fisher, of the above procedure is often useful, hor 
generality we will use Xi for Xw and for xa. Thus we have 
S'v^ = 83340.0. S'yxi = 8477.0 

S'xi® = 2270.1. S'yxa == 17329.4 


S'x 2^ 10525.0. 


S'xiXa == 3530,1 


TO 
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We first solve the simultaneous equations 

bir(x,*) + b,r(xix,)-i (D) 

bxS'(xiX3) + b,SW) = 0 (E) 

and the solution of bi we denote by Caj and the solution of bg by c»a. The bs 
are, of course, the standard partial regression coefficients, and might be more 
completely defoed as in the previous treatment by byi.a and bya.x. In the present 
treatment we will understand the y and latter halves of the subscripts. Sub- 
stituting our values in equations (D) and (E), we get 

Cax . 2270.1 + Caa . 3530.1 = 1 (F) 

cai . 3530.1 + caa . 10525.0 = 0 (G) 

which can be solved by e.g. multiplying the first (F) by 10525.0/3530.1 = 2.981fi^ 
obtaining 

Cax . 6768.0 + Ca, . 10525.0 == 2.9816 
From this we subtract the second equation, (G), to obtain 
cai . 3238.0 = 2.9816 
and 

cai = 0.9208 X 10-« 

We can obtain Caa e.g. by substituting this value of Cai in the second equation as 
c.a = —0.3088 X 10-® 

We now require the solutions of 

bxS'(Xx*) +baS'(xxXa) = 0 (H) 

baS'(XxXa) + bar(Xa*) -1 (I) 

and we will denote the solutions bi and ba by cbx and Cba respectively. 

It will be noted that equations (H) and (I) are identical with (D) and (E) 
except that the right hand sides have been interchanged. As before, we can 
multiply the first equation (H) by 2.9816, obtaining. 

cbi . 6768.0 + Cba . 10525.0 = 0 
from which we subtract (I). 

We thus have 

Cbx = -—0.3088 X 10-« 
and substituting in either (H) or (I) we get 
Cba = 0.1986 X 10-* 

We now can calculate bi and b| as 

bx = Cax . S'(yxi) + CaaS'Cyxa) 

= 0.9208 X 10-* X 8477.0 — 0.3088 x 10-* X 17329.4 
= 2.454 

ba = Cbx S'(yxi) + CbaS'(yx2) 

= —0.3088 X 10-» X 8477.0 -f- 0.1986 x 10-» x 17329.4 
= 0.824 

These solutions of bi and ba are of course the same as obtained previously 
for byw.a and bya.w. It will be noted now where one of the principal advantages 
of this procedure lies. We can easily obtain the partial regression coefficients for 
a second dependent variable Y without the trouble of re-solving the simultaneous 
equations. Thus in the present instances we are only concerned with one 
dependent variable, the stack loss as combined nitrogen. We might in addition 
be interested in the ratio of nitrous to nitric gases and the variations of this ratio 
with the two independent variables. We merely need to obtain the sums of 
products of the new dependent vari ble Y with Xj, and x,, S'(Yxi) and ^'(Yxi) 
and substitute in equations (J) and (K). The additional labour is thus very slight, 
especially when compared with re-solving the simultaneous equations as with the 
earlier treatment. 
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The second advantage of the present procedure is that we can now readily 
obtain the standard error of each partial regression coefficient. We need the 
residual variance ofr® which we already have in Table 10.1. To repeat, in the 
present symbolism, it is 

P'(y)’ - S'(yxt) - b, S'(yx.)] 

= W9 ^ 


Then the standard error of bi is 

VZTi = V354^ VO.9208 x 10-* 

= 0.574 

and the standard error of b 2 is 

== VmO Vo. 1986 X 10“^ 

= 0.266 ^ 

These standard errors have degrees of freedom equal to (n — 3), and since 
n is generally large any regression coefficient twice its own standard error can be 
assiimed significant at the 5% level. This procedure is thus a method of testing 
significance alternative to that set out previously in Table 10.1. We can also set 
the probable limits (95%) for bi as 2.454 ±2 X 0.574 = 1.306 and 3.602. Putting 
it this way makes it clear how many significant figures are worth retaining in the 
final statement of the regression equation. 


(c) The Need for multiple Regression and Partial Correlatioii 


In deriving the regression coefficients byw.a and bya.w, we have kept the 
variable in the subscript after the full stop constant. For this reason these are 
known as “partial” regression coefficients. The simple regression coefficients 
would have been given by 


and 


_ S'yxw ^ 8^ 
“ S'xw* 2270 
_ S'yxa _ 17329 
“ S'xa* ““ 10525 


3.734 

1.646 


It will be noted that these simple regression coefficients are appreciably 
larger than the corresponding partial regression coefficients. The explanation of 
this is easy to see. If we calculate the correlation coefficient of Xw and Xa we get 

2' Xw Xa 3530 

^ V2'xw“ S'x.» “ V2270 X 10525 
= 0.7222 

There is thus a highly significant and very important correlation between Xw and 
Xa. This is a consequence of the nature of the process, of comse. Weather 
conditions that lead to low air temperatures (xa) will also tend to give low cooling 
water temperature (xw). 

Thus when we calculate the simple regression coefficient of y upon Xw, since 
the higher figures for Xw will tend to be associated with the higher figures for 
Xa, the values of y for the larger values of Xw will tend to be larger than they shoxild 
be because they are also associated with the larger values of Xa. The partial 
regression coefficients, however, have the effect of the other variable eliminated, 
and measure only the effect of the variable specified. 
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In this connection the partial correlation coefficients are sometimes used. If 
the simple correlation coefficients are 
rwa = 0.7222 

S'yxw 8477 

Tyw = 


Tya = 


VS'y* X S'xw® 
S'yxa _ 


•v/83,340 X 2270 
17,329 


= 0.6165 


V2'y“X S'xa* V83,340 X 10,525 
then the partial correlation coelEcients are 
ryw — rya . rwa 

y'(l — (1 — j-wa*) 

0.6165 — 0.5850 X 0.7222 


= 0.5850 


and 


V(l- 

■ 0.58502) (1 

— 0.7222^) 

= 0.3458 



rya 

— ryw . raw 

~ v(r= 

ryw^) (1 — 

raw^) 

0.5850 — 0.6165 

X 0.7222 

\/(i— 

0.61652) (1 

— 0.72222) 

= 0.2567 




For testing the significance of a simple correlation coefficient the degrees of 
freedom are (n — 2), if n is the number of observations. For a partial correlation 
coefficient, another degree of freedom is lost, and the number is thus (n — 3). 
In the present case, though both the correlation coefficients are markedly reduced 
in their transformation from simple to partial coefficients, they still remain signi- 
ficant with this large number of degrees of freedom (139 — 3). In general, 
however, it is possible for a positive simple correlation coefficient to be transformed 
into a negative partial correlation coefficient, if, to quote the formula for rya.w, the 
product of ryw and raw is greater than rya. 

While the difference between the simple and partial coefficients was in this 
c^e of interest but not of immediate practical importance, it might well be other- 
wise in other cases. Thus suppose it had been decided that it would be economic 
to instal refrigeration equipment on the basis of the simple regression coefficient 
of 0.373. If the refrigeration would lower the mean cooling water temperature 
by 10°C., we could expect a reduction in the stack loss of 3.73 units on the simple 
correlation, and this might be sufficient to justify the installation. The reduction 
which would be actually achieved is 2.44 units (from the partial regression coeffi- 
cient), and the difference between this and 3.73 might be the difference between 
an economic and an xmeconomic. installation. 


(d) Multiple Correlation with Three Independent Variables 

In the ex^ple we have been considering, the loss from a P.O.P. stack, there 
is a third variable of considerable interest, namely the strength of acid in the 
absorption tower, which we will denote by Xs. 

We are setting out to determine the coefficients (a and the bs) in the equation 

y = ay.w^ +,b yw.as Xw “f" bya.ws Xa bys.aw Xs 

where the symbolism is as before : thus 

ay.was is the value y would have if Xw, x., and x. were held constant at their 
mean values. 

byw.a8 measures the rate of increase of y with increasing Xw, both Xa and x* 
being constant at their mean values. 

Similarly for bya.ws and by8.aw. 
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We require the sums, the sums of squares, and the sums of products with 
the other three variables. As it is desirable to retain rather more decimal places, 
the complete set with the earlier ones recalculated are below : — 


Sy 

= 

1479 

Sy^ 

= 

Sxw 

= 

406 

Xxw* 


SXa 

= 

1 

SXa'^ 

== 

SXs 


70 

Sxs^ 

=: 

Syxw 

= 

12797 

2'yxw 

S'yxa 

= 

SyXa 

= 

17340 

= 

Syxs 

= 

12931 

X'yxs 

S'XwXa 

= 

EXwXa 

r= 

3533 


XXaXs 

= 

5365 

S'XaXs 

= 

SXsX\^ 

= 

1943 

S'XsXw 



99077 

-zy = 83340.014 

3456 

■ S'xw’* = 2270.12950 

10525 

S'xa^ = 10524.9928 

36364 

= 36328.74820 

8477.0431 


17329.3597 


12186.1799 


3530.07914 


5364.49640 


1738.5396 



We now need to solve the three sets of three simultaneous equations (A),. 
(B) and (C) below. The first set has as right hand side for (A), (B) and (C) 
1, 0 and 0 respectively, and the solutions are cai, Cbi, and Cci. The second set 
has as right hand sides 0, 1 and 0 and solutions Ca 2 , Cb 2 , and Cc 2 - Similarly the 
third set has right hand sides 0, 0, 1 and solutions Cig, cba, and Cca- 

Ca 2'xw^ + Cb S'XwXs + Cc S'XwXa = 1, 0, 0 (A) 

Ca S'XaXw + Cb S'Xs® + Cc S'XgXa = 0, 1, 0 (B) 

Ca S^XaXw -j- Cb S^XaXa -|- Cc S^Xa^ = 0, 0, 1 (C) 


The solutions of these equations can be obtained empirically, but a systematic 
method is less liable to accidental errors. 

Whittaker and Robinson(^) contain a general discussion, but the most satis- 
factory method is due to M. H. Doolittle. We set out the equations as (A'), 
(B') and (C') in a table as Table 10.2. Inserting the numerical values quoted 
above, we get (A"'), (B"), (C")- It will be understood that all the figures occurrmg 
in the column headed Ca are effectively the coefficients of Ca in the succ^ive 
equations, and similarly for Cb and Cc. The right hand sides have been multiplied 
by 10* temporarily to avoid the occurrence of excessively large numbers of O’s 
after the decimal points. 

The next step is to write down again (A") as (A'"), and then put underneath 
it the result of dividing it by the coefficient of Ca with the sign changed (here by 
— ^2270.12950). This gives us (D). 

Now set down (B") again as (B'"). Multiply (A"0 by the coefficient of cb 
in (D) (here —0.76583279) ; this is (E). Add to (B'") giving (F). Divide (F) 
by the coefficient of its first term with the sign changed (i.e. by — 34997.3536) 
to give (G). 

Now set down (C") again as (C'"). Multiply (A'") by the coefficient of 
Cc in (D) (i.e. by —1.55501223) to give (H). Also multiply (F) by the coefficient 
of Cc in (G) to give (I). Adding (C'"), (H) and (I) we get J, and dividing throt^ 
by the coefficient of Cc (4833.3424) we get (K). The three figuk_on the right 
hand side are the solutions for Cci, Cc 2 > and Cca, all multiplied by 10*. 

If we now take (G), and substitute these solutions for Cci, Ccj, and Ccg we 
get (Li), (La) and (Lg) respectively. These lead immediately to (L'l), (L\), and 
(L'g), which give the solutions for cbi, cbg, and Cbg, respectively. 

We now substitute the solutions for cb and Cc in (D )2 getting (Mj), (Mg) and 
(Mg), leading to (M'l), (M'a) and (M'g) as the solutions ofcai,Caa, and Cag respectively. 

We can summarise these solutions in a matrix as in Table 10.3. 

C^) E. T. Whittaker and G. Robinson, “The Calculus of Observations*’ (Blackie), 
3rd edition, Chapter IX. 
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Table 10.2 



C* 

Cb 

Cc 

(A'). 

(B') 

(C') 

S'Xw* 

S'XgXw 

S XaXw 

S'XwX, 

S'x.» 

L'XaX, 

• S'XwX, 

I S'x,x« 

S'x.* 

A" 

2270.12950 

1738.5396 

3530.07914 

B" 

1738.5396 

36328.7482 

5364.4960 

C" 

3530.07914 

5364.4960 

1 

10524.9928 

A'" 

2270.12950 

1738.5396 

3530.07914 

D 

—1.00000 

i —0.76583279 

—1.55501223 


1738.5396 

36328.7842 

5364.4960 

E 

—1738.5396 

—1331.4306 

—2703.4503 

F 

0.0000 

34997.3536 

2661.0457 

G 


—1.0000 

—0.076035630 

C" 

3530.07914 

5364.4960 

10524.9928 

(H) 

—3530.07904 

—2703.4502 

—5489.3161 

(I) 


—2661.0458 

—202.3343 

J 



■ 4833.3424 

K 



1.00000 

Lx 


—1.0000 

0.2354659 

L* 


—1.0000 

0.0119615 

L. 


—1.0000 

—0.1573148 

L'l 


1.0000 


L', 


1.0000 


L'. 


1.0000 


Ml 

—1.0000 

—0.0127435 

4.8155366 

M* 

—1.0000 

—0.2279864 

0.2446261 

M, 

—1.000 

0.1204768 

—3.2172607 

M'x 

1.0000 



M', 

1.0000 



M', 

1.0000 
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Table 10.2 — contd. 



First Set 

Second Set 

Third Set 


X 10 * 

X 10 « 

X 10 « 

(A') 

10,000 

0 

0 

(B') 

0 

10,000 

0 

(C') 

0 

0 

10,000 

A" 

10,000 

0 

0 

B" 

0 

10,000 

0 

C" 

0 

0 

10,000 

A'" 

10,000 

0 

0 

D 

— 4.4050351 

0 

0 


0 

10,000 

0 

E 

— 7658.3279 

0 

0 

F 

— 7658.3279 

10,000 

0 

G 

0.218825915 

— 0.28573589 

0 

C"' 

0 

0 

10,000 

(H) 

— 15550.1223 

0 

0 

(I) 

582.3058 

— 760.35630 

0 

J 

— 14967.8165 

— 760.35630 

10,000 

K 

— 3.0967838 

— 0.1573148 

2.0689617 

Lx 

0.2188259 



Ls 


— 0.2857359 


L 3 



. 0.0 

L'x 

0.0166400 



L'x 


0.2976974 


L's 



— 0.1573148 

Mx 

— 4.4050351 



Ms 


0 


M» 



0 

M'l 

9.2078282 



M's 


0.0166397 


M's 



— 3.0967839 
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Table 10.3 

All solutions multiplied by 10* 



a 

b 

c 

1 

9.2078282 

0.0166400 

—3.0967838 

2 

0.0166397 

0.2976974 

—0.1573148 

3 

—3.0967839 

—0.1573148 

2.0689617 


Like the original equations, this matrix is axi-symmetric ; Cbi == Caa^ 

Ccj ' — and Cc 2 Cbs* ■ . , * , ^ , 

A good check on the correctness of the solutions is to t^e the set tor c* and 
substitute them in (C"). There we get for the left hand side : 

(3530.07914 X 9.2078282 + 5364.4960 X 0.01663970 — 10524.9928 X 
3.0967839) X 10-* = 0.00000025 
which is close enough to the expected value of 0. 

We now have as our values for the regression coefficients 

bw.as “ Cai S^(y Xw) “h ^*2 ^ (y ^*) “h *-*3 ^ (y ^*) 

= [9.2078282 x 8477.0431 + 0.0166397 x 12186.1799 
— 3.0967839 x 17329.3597] X 10-* 

= [78055.1564 + 202.7744 — 53665.2795] X 10-‘ 

= 2.4592651 


bvw = cbi S'(y Xw) + cbj S'(y X.) + cba S'(y X.) 

= 0.1042687 

bs.nr = S'(y Xw) + Cca S'(y X,) + Cc 3 S'(y x«) 

= 0.7685143 

A useful check on the accuracy of these last proceedings is to substitute these 
values in the equation 

bw.as 2’x»Xw “I" bs.aw S’XaXs -(- ba.ws S Xa^ = S y Xa 
Here we get 17329.3566 = 17329.3597, a discrepancy of 0.0031 which is 
small enough to be ignored. 

The residual variance is given by 

Cr^ = ^ ^ P'y^ — bw.aa S’yXw — bs.aw 2 yXa ba.aw ^ 


= A [83340.014 — 2.4592651 X 8477.0431 — 0.1042687 X 
12186.1799 — 0.7685143 X 17329.3597] 

= 47904.221/135 = 354.846 

The standard errors of the three regression coefficients are then given by 

= crV^ = V354.847 X 9.2078 x 10-* 

= 0.5716 

«ys.aw = CTrVcbjj = 0.1028 

<Ta.sw — Cb3 = 0.2709 

To test the significance of each coefficient, we obtain the value of t as the ratio 
of the regression coefficient to its standard error, and look up this value of t with 
degrees of freedom equal to those of the residual variance. For ba.8w, for example, 
t = 0.7685143/0.2709 == 2.837 with 135 degrees of freedom. Reference to 
Table I in the Appendix shows that this corresponds to a level of significance of 
between 1% and 0.1%. 
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Confidence limits for each estimate can be derived in the usual way. For 
ba.sw and 95% limits these are 0.768514 ± 1.98 x 0.2709, or between 0.232132 
and 1.304896. 

Incidentally, it will be obvious that we have retained an excessive number 
of significant figures, and we could comfortably discard the last 4. It is much 
better, however, to carry out this discarding at the very end of the process rather 
than at the beginning. 

The final regression equation is 
y — y — b-vf.as (Xw Xw) “f" ba^w (Xg Xg) -f- b^.sw (Xa — Xi) 

y — 1479/139 = 2.459265 (xw — 406/139) + 0.104269 (x. — 70/139) 

+ 0.768514 (xa— 1/139) 
or y = 2.459 Xw + 0.104 x* + 0,768 Xa + 3.399 

(e) Conclusions 

The use of this multiple regression technique involves rather heavy arith- 
metic, but when we have a mass of data with several independent variables it is 
the only sound way of assessing their several effects. 

Thus in the present instance, we were particularly interested in the inde- 
pendent variable with the smallest regression coefficient, Xs. Completely elim- 
inating the effects of the other two independent variables we obtained an estimate 
of the regression coefficient and were able to test its significance against a residual 
of 354.8, against which it was quite non-significant. If we had ignored the other 
two variables we would have obtained the sum of squares due to Xs as 4088, with 
a residual of 79252/137=578.5. Tested against this Xs would appear significant 
at the 1% level, an entirely erroneous conclusion. 

The second reason for using the multiple regression technique was discussed 
at length earlier with reference to the misleading results obtained when using 
simple regression when the independent variables happen to be correlated. In 
this case the independent variables both had positive regression coefficients and 
were positively correlated, and so gave fictitiously large values for both the 
regression coefficients. If, however, they were negatively correlated, we would 
obtain fictitiously small values for the regression coefficients using the simple 
regression technique. Of course, if the independent variables are not correlated, 
then this phenomenon will not occur, and the only loss through using simple 
regression instead of partial (or multiple) regression is the loss of precision in 
our estimate through the residual being much larger. 

As with simple regression equations, extrapolation of a multiple regression 
equation is unwise unless there is a very sound theoretical basis. It is good 
practice to quote the ranges of the independent variables that have occurred in 
the data used for calculating the regression equation, in order to discourage 
extrapolation. 


CHAPTER XI 

THE GENERAL ANALYSIS OF VARIANCE 
(a) Introduction 

It is important to realize that a situation suitable for the analysis of variance 
may arise in several ways : — 

(i) By accident : the process may have a suitable structure as it stands. An 
example of this is discussed in Section (i) of this chapter, where there are three 
factories each handling three types of weak acid. The statistician had no part 
in seeing that there were three factories for each t 3 rpe of weak acid : this happened 
quite fortuitously. 
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(ii) By slight modification of existing manufacturing procedure. Thus it 
may be that in a process to each unit is added a variable amount of rework material 
of uncertain history, and for the purpose of examination this practice could be 
discontinued temporarily. 

(iii) By deliberate experiment. Thus several factors may be varied at the 
levels appropriate to a factorial design. 

(b) Types of Analysis 

One of the difficulties frequently experienced in the use of the analysis of 
variance is to decide which is the appropriate form of analysis. A further diffi- 
culty is deciding the procedure for tests of significance. An attempt will be made 
here to give an outline of a general treatment. 

In the succeeding examples, we shall deal with a hypothetical polymer being 
processed into fibres and the dependent variable is the fibre tensile strength. 
We shall make various modifications to this basic idea in an attempt to show 
how the various forms of analysis of variances are related to each other. After 
each form of this general example we shall deal in detail with another example 
of similar structure, A number of aspects of the use of the analysis of variance 
follow in Chapter XII. 

All analyses can be divided into two broad types : — 

(i) Those in which categories preserve their one-to-one correspondence : 
these we can define as pure factorial analyses. 

(ii) Those in which certain of the categories do not preserve their one-to-one 
correspondence. These we can define as incomplete analyses. 

The meaning of these two classifications will become clear with detailed 
discussion. 

(c) The Two Factor Analysis 

We can frequently categorise a set of individuals in two ways : thus we can 
have the raw material classified into a number of grades g according to the nominal 
degree of polymerisation. There are available a number of batches of each grade, 
and one batch of each grade can be processed at each of t different temperatures. 
We then have two factors : grade of pol 3 rmer with g levels and temperature of 
processing with t levels. If we specify the values of g and t for any individual, 
we have completely identified that individual. 

Such an example was discussed in Chapter VII (e). Table 11.1 below gives 
the degrees of freedom and components of variance. 

Table 11.1 


Source of 
Variance 

Degrees of 
Freedom 

Components 
of Variance 

G 

T 

Residual 

g— 1 

t— 1 

(g^l)(t_l) 

tag* -f" 
gat* + ao* 
ao* 

Total 

gt— 1 

• 


The tests of significance are obvious. If factor G is to exist, i.e. if erg* is to 
be greater than zero, then we must have : — 
tag* + ao* must be greater than o©* 

and we can readily check the significance of this with the variance ratio test 
Factor T can be tested similarly. 
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(d) The Three Factor Analysis 

(i) In the previous section, we had g grades of polymer, one batch of each 
grade being processed at each of t different temperatures. Suppose now tlmt 
each of s supplier supplies g grades of polymer and from each grade of each supplier 
t batches are processed at each of t temperatures. 

It is important that the first grade from Supplier 1 has the same nominal 
degree of polymerisation as the first grade from Supplier 2, and as the first grade 
from Supplier 3, etc. Thus, the first grades for all the suppliers have a certain 
common property which distinguishes diem in some way from the second grades 

of all the suppliers, etc. ^ i j • j 

There are now gts individuals, and each can be umquely detenmned by 
specifying particular values of g, t, and s. We thus have three factors : suppliers, 
grades of polymer and temperature of processing. The technique of the analysis 
of variance for three factors is given later. The degrees of freedom and the 
components of variance are reproduced in Table 11.2. 

Table 11.2 


Source of 
Variance 

Degrees of 
Freedom 

Components of Variance 

G 

T 

S 

G X T 

T X S 

S X G 
Residual 

g— 1 

t— 1 
s — 1 

(g-l)(t-l) 
(t-1) (s-1) 
(s-l)(g-l) 
(g_l)(t_l)(s-l) 

a©* + sagt* + tOgs* + tsag^ 

Oo® + gcJts* + scTgt* + sgat* 

<To® -j- tdsg® + gOst* + gtCJs* 
do* “1“ Sdgt* 
do* + gGf*t* 

<Jo* + tOsg* 
do* 

Total 

gts — 1 



For the tests of significance we must start with the smallest interaction and 
test it against the residual. Thus, if the smallest interaction is, say, S X G, we 
test with the variance ratio test (eo* + against a©*. If it is not sigmfican^ 
we pool it with the residual and obtain a new estimate of c©* ; similarly, with 
the other interactions, taking them in ascending order of magmtude. 

For the testing of the main effects, ^ere are a nimiber of possibihties 
(i) All three interactions are non-significant. Striking Aem out in the table 
of components of variance in Table 11.2, and pooling their degrees of freedom 
v^dth the residual, we obtdn Table 11,3, : — 


L Table 11.3 


Source of 
Varismee 

Degrees of 
Freedom 

Components 
of Variance 

G 

g — l 

a©* + tsdg* 

T 

t-1 

Oo* + 8g®t* 

S 

8 — 1 

Oq* + g<w«* 

Residual 

gtB ^ — (g + t + e) + 2 


Total 

gts — 1 
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For the three main effects the tests of significance are quite obviously to. 
compare (co» + ts<T*“) with for the factor G, and similarly for T and b. 

(ii) One interaction, say, G X T, is significant, _ and the others are not. 
Striking the latter out in Table 11.2, we thereby obtain Table 11.4. 

Table 11.4 


Source of 
Variance 

Degrees of 
Freedom 

Components of 
Variance 

G 

T 
^ S 

. G X T 
Residual 

g-i 

t — 1 
s 1 

(g_l(t_l) 

(gt — 1) (s — 1) 

<yo® + sagt® + tscTg® 

Co® + SCTgt® + SgCTt® 

CTo® + gtCFs® 

Co® “f" SCTgt® 
ao® 

Total 

gts — 1 



To test for the existence of the S main effect, we need to demonstrate the 
existence of cs®, and this can be rigorously done by testing (oo® + gta^^) against 
<jo®. For the G main effect, we need to demonstrate the existence of ctg*, and to 
do this, we must show that 

Co® + scgt^ + tsag2 is greater than Co® + ^ ^ 

i.e. we test the G main effect against the G X T interaction. The T main effect 
is tested similarly. 

(iii) The thiid possibility is that two interactions are significant, e.g. G X T 
and T X S. Table 11.2 then becomes Table 11.5 : 

Table 11.5 


Source of 
Variance 

Degrees of 
Freedom 

Components of 

Variance 

G 

T 

S 

G X T 

T X S 
Residual 

g— 1 

t— 1 
s 1 

(g_l)(t-l) 

(t-l(s-l) 
t(g — 1) (s — 1) 

Co® “1“ scjgt® "H ts<jg® 

ao® + gats® + sagt® + sgat® 

ao® + gcfst® 4" gtcTs® 

Co® + sagt® 
ao® -{“ tast* 
ao® 

Total 

gts — 1 



Studying the Components of Variance, it will be apparent that the significance 
of the G and S main effects can be established by testing their mean squares 
against the G X T and the T X S interactions respectively. However, there is 
no rigorous test to demonstrate the existence of the T main effect, for while we 
might prove that it was greater than the G X T interaction, this might be due 
to the existence of cts® as much as to the existence of at^. 

In practice this impasse is not of any great practical importance, as we would 
break down the analysis into separate two factor G X S analyses for each level 
of T. This procedure is illustrated in Section (e) of this chapter. 

80 





Let us consider the data in the table below, referring to a purification process. 
The figures (the dependent variable x) refer to the purity of a product measured 
in a certain way. The factor T at three levels represented time of boiling. The 
factor A represents solvent used, and Wj represents a cold final wash of the product 
and W2 a hot final wash. 



Ti 

T, 

T, 

Ai Wi 

3.1 

2.0 

1.3 

Ws 

2.1 

1.6 

1.9 

A, 

4.3 

2.8 

4.1 

W, 

6.1 

3.9 

2.6 


In analysing this data, an arbitrary zero of 3.0 was taken. This reduces 
the magnitude of all the numbers, and therefore their squares, and the arithmetic 
becomes less burdensome. Also, to avoid confusion with the decimal point, the 
numbers were then multiplied by 10 . This results in the sums of squares in the 
analysis of variance being 10® = 100 times too large, which does not affect the 
analysis, as in the tests of significance all the terms are compared with each other, 
the change in scale thus being of no importance. If we proceed to calculate any 
particular variance, however, we must correct for this change of scales by dividing 
.by 100. 

The result of the transformation is as below : 



Ti 

T, 

T, 

Ai 

1 

—10 

—17 

w. 

—9 

—14 

— 11 

A, Wi 

13 

—2 

11 

w. 

31 

9 



^ It should be remembered that the data is completely symmetrical with respect 
to the three variables, A, W, and T. This might be clearer if we were able, 
using three-dimensional paper, to set the lower half of the table (for Aj) on top 
of the upper half of the table (for Ai). Then the horizontal axis across the page 
represents difference between times of boiling (T), the vertical axis down the page 
represents type of wash (W), and the axis perpendicular to the page represents 
type of solvent (A). The table as it stands has the fonn of tables for W and T for 
each value of A : that is an arbitrary choice and it might equally well have been 
W and A tables for each value of T, or A and T tables for each value of W. 

We first form the three two-way tables by summing over each of the three 
variables in turn. Thus we sum over W for each value of A and T, getting the 
table below : 




T, 

T3 

Totals 

Ai 

• —8 

—24 

—28 

—60 

A, 

44 

7 

7 

58 

Totals 

36 

—17 

—21 

—2 
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Summing over A for each value of T and W we get the table below 



T, 

T. 

T, 

Totals 

Wi 

14 

—12 

—6 

—4 

W, 

22 

—5 

—15 

+2 

Totals 

36 

—17 

—21 

—2 


The third table is obtained by summing over T for each value of A and 
as below : 




A, 

Totals 

Wi 

—26 

22 

—4 

w. 

—34 

36 

2 

Totals 

—60 

58 

—2 


We analyse the first of these two-way tables, for A and T, as follows : 

Form the following terms — 

(1) Square the individuals, sum the squares, and divide by the number of 
origind individuals that have been summed to give the individuals in this two-way 
table. Here each individual is the sum of two original individuals. Thus we have 

[(^8)* + (— 24)» + (—28)* 4- 44* + 7» + 7*]/2 = 1729 

(2) Obtain the total for each row, square these, totals, sum these ’ squares, 
and divide this total by the number of original individuals that have been summed 
to give the row totals. Here each row total is the sum of 6 original individuals 
(the row totals are each the sum of 3 two-way table individuals, each of which is 
the sum of 2 original individuals). Thus we have 

[(—60)* + (58)^/6 = 1160.67 

(3) Perform the identical operation for the column totals. Here each column 
total is the sum of 4 original individuals, so the divisor is 4. Thus we have 

[36* + (—17)* + (— 21)*]/4 = 506.5 

(4) Obtain the grand total for all individuals, square this grand total, and 
divide by the grand total number of original individuds 

(— 2)*/12 = 0.33 

It will be noted that these operations are similar to the analysis of a row and 
column table, discussed earlier, differing only in that each sum of squares is 
divided, not by the number of individuals in the two-way table in each total 
which is being squared, but by the number of original individuals in each total 
that is being squared. 
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We can now form a table for the analysis of variance of this row and column table. 


Source of Variance 

Degrees of 
Freedom 

Sums of Squares 

Mean 

Squares 

Between A 

a — 1 = 2— 1 
= 1 

(2) — (4) = 1160.33 

1160.33 

Between T 

t — 1 = 3 — 1 
= 2 

(3) — (4)= 506.17 

253.08 

A X T interaction 

(a_l)(t_l) 

= 2 

62.17 

31.08 

Total 

at — 1 = 5 

(1) — (4) = 1728.67 



a = number of levels of A 
t = number of levels of T 

This table corresponds closely to that previously discussed for row and column 
analysis, except that what was formerly “Residual” is now termed “A X T 
interaction.” The reason for this will be discussed later. The Sum of Squares 
corresponding to this term is best obtained as the difference between the sum 
of the A and T terms and the Total, i.e. 

1728.67 — (1160.33 + 506.17) = 62.17 
Precisely similar operations are performed on the other two-way tables. 
Finally, the Total Variance is obtained by squaring aU the original indi- 
viduals, summing these squares, and subtracting from this sum of squares the 
total squared divided by the total number of original individuals, i.e. 

+ (-10)» + (— 17)‘ + (—9)* + . . . (-4)* - = 2139.67 
All these results are now gathered together in Table 11.6. 


Table 11.6 


Source of Variance 

Degrees of Freedom 

Sums of 
Squares 

Mean 

Squares 

Components of Variance 

Between A 

(a — 1) - I 

1160.33 

1160.33 

wta| + woJj + t(j^^ +®o 

Between W 

(w — 1) = I 

3.00 

3.00 


Between T 

(t™l)»2 

506.17 

253-08 

wao^t + 

A X W Interaction 

(a — 1) (w -- i) » I 

40.33 

40.33 1 

+ "0 

W X T Interaction 

(w — i) (t — i) » a 

45-50 

22.75 


T X A Interaction 

(t — I) (a — i) =2 

62 17 

31.08 


Kesidual 

(a— Oft — i> 

(W l) = 2 

322.17 

161.08 

+ <^0 

Total 1 

awt — I »= 1 1 

2139.67 




a, w, and t are. respectively equal to the number of levels of A, W, and T. 
da, <jw and o/ are respectively equal to the variances due to general differences 
between levels of A, levels of W, and levels of T. 

a4, city are respectively equal to the variances due to interactions between 
A and W, between W and T, and between T and A. 

is the residual variance, unattributable to any of the above terms. 

The sum of squares corresponding to the residual is obtained as the difference 
between the total sum of squares and the sum of the six other terms. 
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The meaning of these terms is as follows : 

of is the variability that would occur if the other two factors W and T were 
held constant, similarly <t;. is the variability that would occur if A and T were held 
constant and a; the variability that would occur if A and W were^ held constant. 

The A X W Interaction is a measure of the extent to which the A effect 
depends on the value of W, and conversely, to which the W effect depends on the 
value of A. Similarly for the other interactions. 

The Residual is that portion of the variability that cannot be allocated 
to the other six terms. It can be regarded as a measure of error. In the present 
experimental design, the possible second order interaction A X W X T, if it 
exists, will contribute to the same term. The second order A X W X T measures 
the extent to which the A X W interaction depends upon the level of T, or 
similarly, to which the A X T interaction depends on the level of W, similarly, 
to which the T X W interaction depends on the level of A. If this effect does 
exist, it will give us an inflated estimate of error, and hence the effects will be 
likely to not attain significance. This situation is discussed later in Chapter XII (e). 
For the present we will assume that the A X W X T interaction does not exist, 
or at any rate is not large. 

We now need to apply a test of significance for the existence of these terms. 
We can see that if, for example is to exist, then (w< + cl) must be signi- 
ficantly greater than a;, i.e. the T X A interaction mean square must be signi- 
ficantly greater than the residual mean square. 

Here we see that the Mean Squares for the W, A X W, W X T, and T X A 
are all less than the Residual Mean Square, so that cr“, and a,i do not 

exist, i.e. are zero. We can therefore use these mean squares as estimates of ci 
to get a more accurate estimate. We pool their sums of squares and the degrees 
of freedom, i.e. 

(3.00 + 40.33 + 45.50 + 62.17 -h 322.17)/(1 + l-f 2 -f 2 4-2) - 59.1 
This new residual has 8 degrees of freedom. 

Considering the T term, the T mean Square now estimates wa c; + al 
for Cat and c^t have been shown to be zero. To establish the existence of a; we 
therefore test the T mean square against the (new) residual. The variance ratio 
is 253.08/59.1 = 4.3 for degrees of freedom ni = 2, ng = 8. This lies close to 
the 5% level of significance, so we may assume that T does have an effect. 

Similarly the A term gives a variance ratio of 1160.33-/59.1 = 19.6 for degrees 
of freedom ni 1, ng ~ 8, this being considerably more significant than the 
1 % level, so we assume that A does have an effect. 

We now can take the means for each level of A and T, and conclude that 
for Al we have, translating back into terms of the original variable, a mean value 
2.00 and for Ag a mean value of 3.97. For Ti the mean is 3.90, for Tg 2.58, and 
for Ta 2.48. It is clear that the difference between Tg and Ts is so slight as to 
be not significant, but the difference between these two and Tj is quite consider- 
able. We can also conclude that the difference between the two levels of W is 
quite insignificant. 

Each of the averages for the three levels of T is the average of 4 results. In 
the units in which the analysis of variance was carried out the residual variance 
was 59.15, corresponding to a standard deviation of 7.691, or converting to original 
units 0.7691. Accordingly the standard deviation of the average of four results 
will be 0.7691 /V4 = 0.3845. This then is the standard deviation of each of the 
three averages quoted above for the three levels of T. The standard deviation 
of the difference between any two is 0.3845 V2 = 0,5429. 

Proceeding similarly for the two averages for the two levels of A, these each 
have standard deviation of 0.7691 /V6’= 0.3140, and the standard error of the 
difference between these two averages, which is 3,97 — 2.00 = 1.97, is 0.3140 V2 
— 0.4440. The residual on which this standard error is based has 8 degrees of 
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freedom, for which for the 5% level of significance t = 2.31 (see Table I, 
Appendix). Accordingly the 95% confidence limits for the difference between 
Ai and are 1.97 dz 2.31 X 0.444, or 3.00 and 0.94. 

In this example, having demonstrated the significance of certain effects, we 
have proceeded to calculate the means for each level of these effects in order to 
demonstrate their mode of action. In this example each factor and its levels 
had precise and obvious meanings, and when we compared Aj with A 2 we knew 
what comparison we were making (we were comparing one solvent with another). 
In some types of analysis, however, the difference between different levels of a 
particular factor may not be obvious. Thus a factor may be shift, with 3 levels. 
Or it may be day, with any number of levels. Or it may be machine for a group 
of allegedly similar machines. With this type of factor, where the distinction between 
the levels is rather vague and elusive, it is often better to calculate the actual 
magnitude of the variance due to each term. 

(e) The Four Factor Analysis 

fi) In the previous section, we had g grades of polymer by each of s suppliers 
being processed at each of t temperatures. If we suppose that the suppliers were 
supplied with raw material from each of r sources, then R becomes a fourth main 
factor. There are thus gtsr individuals, and each can be . uniquely determined 
by specifying particular of values of the categories. It is important that every 
supplier draws from each source of raw material. 

The components of variance are reproduced in Table 11.8 below, along with 
the respective degrees of freedom. 

Table 11.8 


Source of 

Y ariance 

Degrees of 

Freedom 

Components of 

Variance 

G 

g - 1 

rcgts® -j- tasrg® "f" scyrgt* 

+ sragt® + tsarg® + rtergs® 

+ tsrcTg^ 

T 

t--- 1 

(Jo® + rogts® gutsr® + ScJrgt* 

+ srcjgt® + rgCJst® 4- gs^rt® 

+ gsrat® 

S 

s— 1 

(Jo® 4“ rcTgts® ~|~ 4“ ta®srg 

+ rgats® + gtcsr® + rtags* 



+gtras® 

R 

r — 1 

(Jo® + gotsr® + Sa*rgt “b t<T®srg 

+ gta^sr + tsa^rg + gscy®rt 
+ sgtar® 

G X T 

(g-l)(t-l) 

do® + Sdrgt® + ra®gts + sra®gt 

T X S 

(t-l)(s-l) 

do® + rd®gts + gdhsr + rgd®ts 

S X R 

(s-l)(r-l) 

do® + td®srg + gtd®sr 

R X G 

(i--l(g-l) 

do® “h td®t>rg -|” Sd®rgt -f- tSd®rg 

G X S 

(g-l)(s~l) 

do® + rd^gts + ta®srg + rtd®gi. 

T X R 

'(t-l)(r-l) 

do® +• gd®tsr + Sd®rgt + gSd®rt 

G X T X S 

(g~l)(t-l)(s-l) 

(Jo® + rd®gts 

T x S X R 

(t-l)(s-l)(r-l) 

do® + gc^^tsr 

S X R X G 

(s— 1) (r— 1) (g— 1) 

Go® + td Vg 

R X G X T 

(r_l)(g_l)(t-l) 

Go® + Sd^rgt 

Residual 

(g_l)(t_l)(s-l)(r-l) 

do® 

Total 

gtsr — 1 



The appropriate tests of significance can be worked out from the components 


of variance. 
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(ii) The data below is from a four factor experiment, each factor at two levels. 
The experiment is on a purification process, and high figures for the dependent 
variable correspond to an inferior product. The treatments were such that the 
crude material could be given a hot or cold wash (Hi or Hj), then given a period 
of boiling or not given a period of boiling (Bg or Bj), then dissolved, filtered, and 
precipitated from either of two solvents (Si or S 2 ), and finally given a cold or hot 
wash (Wi or Wg). 


Si 

s, 

Wi 

W2 

Wi- 

W2 




B^ 

Bi 

Bj 

Bi 

B, 1 

Hi 

H2 

Hi 

H2 

Hi 

H, 

Hi 

Ha 

Hi 


Hi 

H2 

Hi 

H, 

Hi 

H, 

4 .a 

2.7 

1.4 

1.3 

3-3 

3-5 

1-3 

1.9 

2.2 

3.1 

4.0 

4.1 

5.0 

3.0 

2.2 

2.6 

X 7 

z 

11 

12 

8 

10 

12 

—6 

—3 

6 

IS 

16 

as 

5 

—3 

X 


In carrying out the analysis of variance, for convenience 2.5 was subtracted 
from each figure, and the results multiplied by 10 to remove the decimal point, 
this giving the figures in the last row. The effect of this transformation is that 
all the sums of squares in the analysis of variance are 10* = 100 times too large. 

We proceed (^) as follows : — 

(1) We form the four tables obtained by summing over each of the four variables 
in turn. Thus the table obtained by summing over S is as below : 


Wi 

W, 

I 

Ji 

B. 

I 


B, 

Hi 


Hi 

H, 

Hi 

H, 

Hi 

H. 

14 

8 

4 

4 

33 

15 

—15 

—5 


The table obtained by summing over W is as below : 


Si 


S 

2 


I 

Ji 

I 

5* 

Bi 

B. 

Hi 

H, 

Hi 

Hi 

Hi 

H, 

Hi 

Hi 

25 

12 

—23 

—18 

22 

11 

12 

17 


The other two three-way tables, obtained by summing over B and over H, 
are similar. 

(^) For an alternative method of calculating the analysis of variance of a experiment 
see page 138. 
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(2) We form the six tables obtained by sumniing over the variables two at a time. 
Thus the tables obtained by summing over S and H, and over S and B, are as 
below : 


Summed over H and S 



Bx 


Totals 

W, 

22 

8 

30 

W, 

48 

—20 

28 

Totals 

70 

—12 

58 


Summed over S and B 



Hi 

H, 

Totals 


18 

12 

30 

w. 

18 

10 

28 

Totals 

36 

22 

58 

i 


The other two-way tables are as below : 
Summed over H and W 


Summed over B and W 



Bx 

B. 

Totals 

Sx 

37 

—41 

—4 

Sa 

33 

29 

62 

Totals 

70 

—12 

58 



Hx 

H, 

Totals 

Sx 

2 

—6 

—4 

S, 

34 

28 

62 

Totals 

36 

22 

58 


Summed over B and H Summed over S and W 



Wx 

Wx 

Totals 

Sx 

—4 

0 

—4 

s. 

34 

28 

62 

Totals 

30 

28 

58 



Hx 

Hx 

Totals 

Bx 

47 

23 

70 

Bx 

—11 

—1 

—12 

Totals 

36 

22 

58 


(3) We take the square of the grand total and divide it by the grand total number 
of observations : — 

58V16 = 210.25 

We shall use this quantity a great deal, and refer to it as “the correcting 
factor,” as it corrects all our sums of squares for the fact that they are deviations 
from the zero, not the mean. 

(4) To form the Sum of Squares for the main effect of the factor S, we square 
the S totals, sum these squares, and divide by the number of original individuals 
forming each S total, and subtract from this the correcting factor, i.e. 

[(— + 62^] /8 — 210.25 = 272.25 

The other main effects are obtained similarly. Thus — 

For W : (30^ + 282)/8 — 210.25 = 0.25 
For B : (TO^ + (— 12)2)/8 — 210.25 = 420.25 
For H : (36^ + 222)/8 — 210.25 = 12.25 

(5) The first order interactions measure the extent to which the effect of one 
factor depends upon the value of the other factor. Thus there are three first 
order interactions involving S, namely S X W, S X B, and S X H. 

The Sum of Squares corresponding to the first mentioned, S X W, is obtained 
by squaring the individuals in the two way table for S and W, summing these 
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squares, and dividing by the number of original individuals making up the indi- 
viduals in this table, and from this subtracting the correcting factor, the S Sum 
of Squares, and the W Sum of Squares, i.e. 

4 - 02 + 342 + 28 2 )] /4 — 210.25 -- 272.25 --- 0.25 = 6.25 
Similarly, the B X H interaction sum of squares is given by 
(472 4 232 4 (—11)2 4 __ 210.25 — 420.25 -- 12.25 = 72.25 

The other first order interactions are derived likewise. 

(6) It will be apparent that there is a possibility that the effect of, say, the B X H 
interaction may depend upon the value of W or of S. The former is termed the 
S X B X H interaction and the latter the W X B X H interaction. Interactions 
of this type are known as second order interactions. 

The sums of squares for these terms are derived from the three way tables. 
Thus to derive the Sum of Squares for the S X B X H interaction we first square 
the individuals in the S X B X H table, sum these squares, and divide by the 
number of original individuals making up each individual in the table, and sub- 
tracting the correcting factor, i.e. 

(252 4 122 4 (— 23)2 4 (—18)2 4 222 4 IP 4122 4172)72 — 210.25 
= 1119.75 

From this figure we subtract the sums of squares for all the main effects and all 
the first order interactions which do not involve the odd variable. Thus with 
the iL- X B X H sum of squares the odd variable is W, so we subtract from 
1119.75 the sums of squares for the S, B, H, S X B, B X H, and H X S terms, i.e. 

1119J5 _ 272.25 — 420.25 — 12.25 — 342.25 — 72.25 — 0.25 = 0.25 
The other second order interactions are obtained similarly. 

(7) The Total Sum of Squares is the difference between the sum of the squares 
of the original individuals and the correcting factor, i.e. 

172 4 22 4 (—11)2 4 ... 4 (— 3)2 4 p _ 210.25 = 1877.75 

( 8 ) The Sum of Squares corresponding to the Residual is obtained by subtracting 
the four main effects, the six first ordei interactions, and the four second order 
interactions from the Total Sum of Squares. 

We thus have a complete analysis as in Table 11.9 below. The degrees of 
freedom are of the following form (small letters represent the number of levels 
at which the factor denoted by the capital letter occurs). 

Table 11,9 


Source of Variance 

Sums of Squares 

B 

420.25 

H 

12:25 

S 

272.25 

w 

0.25 

B X H 

72.25 

H X S 

0.25 

S X W 

6.25 

W X B 

182.25 

W X H 

I 0.25 

B X S 

! 342.25 

B X H X S 

1 0.25 

H X S X W 

1 156.25 

S X Vv X B 

272.25 

W X B X H 

30.25 

Residual 

110.25 

Total 

1877.75 
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(a) for the main effects, e.g. for B, (b — 1) 

(b) for the first order interactions, e.g. for H X B, (h — 1) (b — 1) 

(c) for the second order interactions, e.g. for H X B X W, (h — 1) 

(b-l)(w-l) 

(d) for the residual, (h — 1) (b — 1) (w — 1) s — 1) 

(e) for the total, (h b w s — 1). 

In the present case, the degrees of freedom are in every case 1 except for 
the total. Accordingly, the column for Degrees of Freedom is omitted. Also, 
since the Sums of Squares thus equal their respective Mean Squares the latter 
column is omitted. 

Analysing the various terms it is clear that the W X B X H and B X H X S 
interactions do not exist, so we can pool them with the residual giving a new 
residual of 140.75/3 = 46.92 with 3 degrees of freedom. The H X S X W 
interaction is not significant, and we get a new residual of 297.00 /'’4 ~ 74.25 with 
4 degrees of freedom. Tested against this the S X W X B interaction does not 
reach the 5% level of significance, but it is suspiciously large, and it is desirable 
to split the experiment up into two three factor experiments to check that the 
factors are not interacting. 

With one second order interaction significant, say the S X W X B interaction, 
there is a choice of three methods of splitting, viz. H X W X B for each level 
of S, H X S X W for each level of B, or H X S X B for each level of \V. 

Which method of breakdown we employ is largely a matter of choice ; some- 
times it may be informative to employ more than one. Here one particular 
method of breakdown is strongly indicated, namely according to the solvent used 
(factor S). 

Accordingly, we carry out two three factor analyses, one for Si and the other 
for S 2 . The results of such analyses are below in Table 11.10. 

Table 11.10 


Source of 
Variance 

Mean Squares 

Experiment with 
S at Si 

Experiment with 

S at S 2 

B 

760.5 

2.0 

H 

8.0 

4.5 

w 

2.0 

4.5 

B X H 

40,5 

32.0 

H- X W 

72.0 

84.5 

W X B 

4.5 

450.0 

Residual 

12.5 

128.0 

Total 

900.0 

705.5 


t’4 In this respect the present example constitutes a special case. The adaptation of 
these methods of computation to the more general case is quite straightforward. The 
formulae for the degrees of freedom have just been given. With regard to the sums of 
squares, if we have a four factor experiment with factors A, B, C, and D at 2, 3, 4, and *5 
levels respectively, there are two A totals each the total of 3 x 4 x 5 = 60 observations, 
so the sum of the squares of the A totals is divided by 60 before having the correcting factor 
subtracted from it to give the A sum of squares. There are five D totals, each the total 
of 2 X 3 X 4 = 24 observations, so the division of the sum of the squares^ of the D total is 24. 
For the interaction, for example, the B x C table has 3 x 4 = 12 entries, each the total of 
2x5 = 10 observations ; accordingly the sum of the squares of these 12 entries is divided 
by 10 before having subtracted from it the correcting factor and the B and C Sums of Squares 
to give the B x C Sum of Squares. 
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The Degrees of Freedom for each of the terms except the Total are 1 in every 
case, so the Mean Squares equal their respective Sums of Squares, and therefore 
columns for Degrees of f reedom and Sums of Squares are omitted from the above 
table. 

Let us consider the Si experiment first. The W X B interaction is not 
significant, and neither is the B X H interaction. The H X W interaction is also 
found to be non-significant. Clearly the H and W main effects are non-significant. 
Pooling all these non-significant terms we get a residual of 23.2 with 6 degrees 
of freedom. The only term left is the B main effect, giving a variance ratio of 
760.5 /23.2 = 32.8 with degrees of freedom ni = 1, n 2 = 6. This lies near the 
0.1% level of significance. 

Wt can thus conclude that the B factor has an effect upon the quality of the 
product, Bi giving a mean of 3.47 and Bg a mean of 1.47. The significance of 
the other factors W and H has not been established, the means for Hi and Hg 
being 2.55 and 2.35 respectively and for Wi and Wg being 2.4 and 2.5 respectively. 

It thus seems a matter of indifference which level of H and of W we use, 
but it is very important to select the right level of B. 

Now let us consider the S 2 experiment. We have a large residual. The 
B X H and H X W interactions cannot be significant, and pooled give a new 
residual for 81.5 with 3 degrees of freedom. The H main effect does not contain 
amongst its components of variance awb^, which is contained amongst the com- 
ponents of variance of the outstanding interaction W X B : the H term is non- 
significant and can therefore be pooled to give a new residual of 62.25 with 4 
degrees of freedom. Tested against this the W X B interaction gives a variance 
ration of 450.00/62.25 == 7.2 with degrees of freedom ni = 1, n 2 = 4, this being 
close to the 5% level of significance. 

The fact that the W X B interaction is marked for Sj but non-existent for 
Si implies the existence of a W X B X S interaction, which we had suspected 
from the four factor analysis. The non-existence of a B main effect for S 2 as 
compared with its great significance for Si indicates the existence of a S X B 
interaction : the four factor analysis had showed that this exists but actually in 
the more complicated guise of an S X B X W interaction. 

Here we have a three factor analysis with a significant first order interaction, 
W X B. Accordingly we need to break the three factor experiment down, either 
into W X H experiments for Bi and for B 2 or into B X H experiments for Wi 
and for Wg. 

The results of the former are below in Table 11.11. Here again the Degrees of 
Freedom for each term except the total are 1, so that column and the Mean Square 
column are omitted. 


Table 11.11 



Sums of Squares 

Source of Variance 

B at 


Bx 


H 

30.25 

6.25 

W 

182.25 

272.25 

Residual 

210.25 

2.25 

Total 

422.75 

280.75 
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Considering the Bg experiment, the H term is not significant, and can be 
pooled to give a new residual of 8.50 /2 = 4.25 with two degrees of freedom. 
The W term then gives a variance ratio of 272.25 /4.25 = 64.0 for n^ = 1, Uj = 2, 
this being more significant than 5%. 

We can thus conclude that using the solvent S 2 and treatment Bg (boiling) 
it is a matter of indifference which level of H we use, but that it is important 
to select the correct level of W, the averages for Wi and W 2 being 4.05 and 2.40 
respectively. 

Turning now to the Bi experiment, we have a large residual, which suggests 
that the H X W interaction exists. From the fact that it clearly does not exist for 
B = Bg, this suggests that it takes the guise of a B X H X W interaction. This was 
not shown up on the full four factor analysis, but the evidence here is suggestive. 
With such a large residual, the H and W terms will not be significant, but we 
may get an idea of the behaviour of the system by forming an H X W table, 
Table 11.12. 


Table 11.12 



Wi 

Wi 

Hi 

2.2 

5.0 


3.1 

3.0 


This table suggests that with Sg Bi, we should use Wi, and possibly that 
Hi is better. However, these suggestions cannot be made as definite assertions. 

That is as much as can be done with that method of breakdown. Turning 
to the other, the results of the analysis are below in Table 11.13. The degrees of 
freedom are 1 for each term (except the total) so that column and the Mean Square 
column are omitted. 


Table 11.13 


Source of 

Sum of Squares 

Variance 

W at 


Wi 

W, 

H 

25 

64 

B 

196 

256 

Residual 

16 

144 

Total 

237 

464 


Considering the experiment for Wi, clearly the H term is not significant. 
Pooling it we get a new residual of 41 /2 = 20.5 with 2 degrees of freedom. The 
B term then has a variance ratio of 196/20.5 = 9.55 for ni = 1, ng = 2. This 
is considerably more significant than the 20% level but less significant than the 
5%. However, we may regard it provisionally as real. 

We can thus conclude that using solvent Sg and treatment W^ it is a matter 
of indifference which level of H we use but.it is important to use the correct level 
of B, the means for Bj and Bj being 2.65 and 4.05 respectively. 
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Considering the experiment for Wg, we have a large residual, and therefore 
we cannot expect significance for the H and B terms. However we may get an 
idea of the behaviour of the system by inspecting a B X H table, as below. 


S= IV = W2 



Bi 


Hi 

5.0 

2.2 


3.0 

2.6 


This table suggests that with S 2 Wg, we should use Bg, and possibly is 
better. 

Let us now see whether the two methods of breakdown give consistent 
conclusions. The Bi and B 2 method gave that with Bg we should use Wg to get 
a low figure for the impurity, but that with Bj we should use Wi. The Wj and 
W 2 method gave that with Wi we should use Bi, and with W 2 we should use 
Bg. As regards H, the Bi and Bg method did not give significant results, but the 
Wi and W 2 method suggested (qualitatively, not with an exact test of significance) 
that Hi should be used to give low figures for the impurity. The two methods 
of breakdown thus lead to similar conclusions. 

The effect of the W X B interaction for Sg is shown in Table 11.14 below* 

Table 11.14 


S = SziH averaged) 



Bi 

B. 

Wi 

2.65 

4.05 

w. 

4.0 

2.4 


This table shows clearly that to obtain a low figure for the impurity, if we 
use treatment Bi, then we must use Wi, and if we use Bg then we must use Wg. 
There is no *^best” level for either B or for W : the “best’’ levels depend upon 
the levels of the other factor. Also solvent Sg behaves differently in this respect 
to Solvent Si, for which treatment Bg was always best irrespective of the values 
of H or W. The interaction shown above is the B x W interaction, and since 
it operates for Sg but not for Sj we can say that the S x W X B interaction exists. 

It is of interest to note the type of error into which we might have fallen if 
we had carried out this experiment classically rather than factorially. Classically 
we would- have investigated each factor at one level. If we were unlucky we 
would have investigated B at Wg and W at Bi. From the first experiment we 
would have decided that in order to get a low figure for the impurity we should 
use Bg and from the second experiment that we should use Wi. We thus would 
have decided to use Wi Bg, an entirely faulty conclusion. 

This has been a lengthy analysis, and we might recapitulate the conclusions 
reached : — 

(1) Using the solvent Si 

(a) it is a matter of indifference whether we use Hj or Hg 

(b) it is a matter of indifference whether we use Wi or Wg 

(c) it is desirable to use Bg giving a mean figure for the impurity of 1.47 
as compared with ZAi for Bi. 
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(2) Using the solvent Sg 

(a) it is a matter of indifference whether we use Hi or Hg. 

(b) using Bi we should use Wi, when we get a figure of 2.65 for the 
impurity. 

(c) using B 2 we should use Wj, when we get a figure of 2.4 for the impurity . 

These conclusions have all been reached rigorously, with no guess work, and 

give a complete picture of the behaviour of the system. 

It will be noted that this experiment has been complicated in the sense that 
there have been second order interactions significant. It would have been simpler 
if only the main effects, or first order interactions, had been significant. With 
second order interactions existing, however, the operation of the system is com- 
plicated, so it is inevitable that its analysis should be also. 

It will also be noted that with significant interactions the factorial experiment 
loses its superiority in the sense of efficiency over the classical, and in effect reduces 
to a series of classical experiments. The point is that if a factorial design is not 
used, then these interactions cannot be detected.. 

These questions are discussed further in Chapter XII (d), but meantime it 
must be noted that Tables 11.11 and 11.13 must be considered not satisfactory 
owing to the inadequate number of degrees of freedom. When thesee high order 
interactions are significant we require more experiments than were made in this 
instance. 

(f) The Five Factor Analysis 

(i) In the previous section, we had four factors, source of raw material (R), 
supplier of polymer (S), grade of pol 3 rmer (G), and temperature of Processing (T). 
Let us now assume there is a fifth factor, namely, pressure of processing (P). 

The procedure for five factor analysis of variance can be readily generalised 
from the two, three, and four factor analyses discussed earlier : — 

(1) Sum over each of the variables in turn. This will give us five tables, each 
containing four variables, 

(2) Sum over the variables taken two at a time. This will give us ten tables, 
each containing three variables. 

(3) Sum over the variables taken three at a time. This will give us ten tables, 
each containing two variables. 

(4) Derive the marginal totals of the tables (3). This is equivalent to sum- 
ming over the variables taken four at a time. 

(5) Sum over the variables taken five at a time, i.e. take the grand total. 

(6) Sum the squares of all the figures in the tables described m (1) to (5) 
and divide each sum of squares by the number of original individuals 
forming the individuals in each table. Thus, in the table of class (3) 
formed by summing over G, T, and S, i.e. the table for R and P, the 
divisor is gts. 

(7) The main effects are given by subtracting (5) from each of the sums of 
squares in (4) (both having been divided by their appropriate divisors). 

(8) To get say the GTS interaction, we take the table for G, T, and S (i.e. the 
table of class (2) summed over R and P), take its sum of squares divided 
by the appropriate divisor (here rp) and subtract from this the correcting 
factor (5) divided by its divisor (gtsrp) and also subtract the sums of 
squares already derived for G, T, S, S X G, G X T and T X S. 

Studying the components of variance for a four factor analysis in Table 11.8, 
and comparing them with Table 11.2 for three factors and Table 11.1 for two factors, 
we can readily see how for a five factor experiment the components of variance 
would be built up : — 

(a) The highest order interaction, used as a residual, cto® has no coefficient. 
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(b) The interactions of the next highest order have a©* and the component 
corresponding to the interaction, the latter having as its coefficient the 
letters not included in the subscripts of the interaction. Thus the 
G X T X S X R components are 

pCgtsr® “h <Jo® 

(c) The interactions of the next highest order have cto*, the component 
corresponding to the interaction, and all the intermediate interactions 
whose suffices include the interaction suffices. The general rule for 
coefficients applies throughout : the coefficients are those letters which 
are not included in the subscripts. 

Thus the G X T X S components are : — 

xpagta^ + rtj VsP + poW + do* 

(d) The first order interactions are built up according to the same general 
rules. Thus the S X R components are : — 

gtpa^ar -f- gta^psr +* tpa^gsr “j- pg<y®8rt -f- gcr^part -f- t<J*g8rP pcr*gart 

+ 00 * 

(e) The main effects are built up similarly. Thus, the S main effect includes 
Os*, the first order interactions of S, namely, o^ag, o^gr, o^sp and o^at, the 
second order interactions of S, namely o*8gr, o^sgp, o^arp, o2«tp a*atr and 
oVg, the third order interaction of S, namely o^rgst, o^gtrp, o*srPg and 
<j*spgt, and the residual oo*. All these components have their coefficients 
determined by the general rule mentioned earlier : thus os* has as its 
coefficient pr^. 

(ii) A five factor experiment was carried out on a process for purifying a 
■crystalline product, all factors being at two levels. The quantity of the substance 
per cycle (T) was tested, one level actually corresponding to double the concentra- 
tion of the other. The time to dissolve (D) the crystalline substance in the solvent 
was tested, one level being four times the time of the other level. Time was 
relevant because the dissolving was done in hot solution and as the formation of 
impurities is auto-catalytic a long period at a high temperature might have dele- 
terious effects. The third factor W represented two sb'ghtly different liquids 
used in the next stage of the process, in which there were two speeds of stirring 
(S) and two times (G) to precipitate the pure crystal by steam distillation. 

The original data is given in Table 11.15. The decimal point has been 
removed and large figures represent an inferir products. 

Table 11.15 

Original Data on Five Factor Crystal Purification Experiment. 



Wi 

Wx 


Gx 

G, 

Gx 





D, 

Dx 

Dx 

Dx 


Dx 

D. 

T, 

Si 


108 

113 

95 

103 

137 


116 


S. 

116 

119 

106 

114 

122 

122 

128 

148 


Sx 

155 

170 

131 

172 

146 

152 

135 

144 


S. 

133 

wm 

138 

181 

150 

156 

124 

146 
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In the analysis of variance, all factors being at two levels, the degrees of 
freedom for all terms are unity, and hence the Mean Squares are equd to the 
Sums of Squares. In Table 11.16 giving the Analysis of Variance below the fourth- 
order interactions have not been calculated but have been pooled with the Residual 
whose degrees of freedom are thus (1 5) = 6. Strictly speaking, these inter- 

actions should be calculated, but in practice it is difficult to attach any great 
importance to them, and it is the effects earlier in the table which are of practical 
interest. In Table 11,16 the figures 31 and 6 in brackets after the Total and the 
Residual are the numbers of degrees of freedom for the Total and Residual 
respectively. 

Table 11.16 

Analysis of Variance for Five Factor Eocperiment on Crystal Purification^ 


Source of Variance 

Sums of Squares 

S 

91.1 

G 

325.1 

W 

21.1 

T 

8256.1 

D 

1352.0 

S X G 

264.5 

G X W 

12.6 

W X T 

924.5 

T X D 

741.2 

D X S 

171.1 

G X T 

18.0 

VV X D 

1.2 

T X S 

128.1 

D X G 

120.1 

W X S 

112.5 

s X G X W 

21.1 

G X W X T 

406.1 

W X T X D 

1200.4 

T X D X S 

0.4 

D X S X G 

24.6 

S X W X T 

91.1 

S X G X T 

91.1 

T X D X G 

40.5 

G X W X D 

31.9 

D X W X S 

288.0 

Residual (’6) 

661.6 

Total (31) 

15396.0 


It is apparent that all the second order interactions except W X T X D are 
non-significant, but the latter is large. To test W X T X D, we must note that 
its components of variance are included amongst the components of variance of 
W X T, T X D, D X W, W, T, and D, and therefore the latter terms cannot 
be less than W X T X D. Of these latter terms, only the T and W main effects 
are greater than W X T X D. Accordingly we pool the others with W X T X D 
and obtain a new estimate of the latter of 2888.4/5 = 577.7 with 5 degrees of 
freedom. Testing this against the residual of 1656.4/15 = 110.4 with 15 degrees 
of freedom, we get a variance ratio of 5.2 for ni = 5, nj = 15, corresponding to 
a probability of 1%. Since T is the more prominent main effect, it would seem 
most profitable to break down the five factor analysis into two four factor analyses 
for each level of T, as in Table 11.17. 

Examining the terms for Ti, starting from the Residual and working towards 
the top of the table, we find none of them significant. Ti actually corresponds 
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to the use of the dilute solution. Under these conditions, therefore, we are at 
liberty to choose the four factors at whatever levels may be convenient on prac- 
tical grounds secure in the knowledge that this will not have a deleterious effect 
upon the purity of the product. 

Table 11.17 

Analysis of Variance for two four factor experiments in Crystal Purification, 


Source of Variance 

Sums of Squares 

T, 

T, 

W 

333.06 

612.56 

s 

217.56 

1.56 

G 

95.06 

248.06 

D 

45.56 

2047.56 

W X S 

203.07 

0.57 

S X G 

333.07 

22.57 

G X D 

10.57 

150.07 

D X S 

76.57 

95.07 

W X D 

637.57 

564.07 

G X W 

280.57 

138.07 

W X S X G 

45.55 

175.55 

G X S X D 

85.55 

5.08 

S X D X W 

430.55 

10.55 ' 

W X G xD 

27.55 

7.55 

Residua! 

162.58 

76.55 

Total 

2984.44 

4155.44 


Reverting to Table 11.16, it will be noted that the T main effect, when tested 
against W X T X D (all the first order interactions involving T are non-significant) 
is significant at the 5% level, nearly at the 1% level. The mean for T^ is lower 
than Ta, and thus for purity we should choose Tj. However, since the charge 
is only half of that for Tg, the throughput of material through the plant will be 
only half, and it may be necessary therefore to use Tg if at all possible. 

Reverting to Table 11.17, and examining the analysis for Tji, we find W X D 
is at the 5% level of significance, and accordingly separate analyses were made 
for each level of D, as in Table 11.18. 

Table 11.18 


Source of Variance 

Sums of Squares 

D, 

1^2 

G 

392.0 

6.15 

W 

0.5 

1176.15 

S 

60.5 

36.15 

G X W 

40.5 

105.10 

W X s 

8.0 

3.10 

S X G 

24.5 

3.10 

Residual 

242.0 

10.15 

1 Total 

768.0 

1339.90 
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For the Di Series, all terms are non-significant with the possible exception 
of G, the means for Gi and Gg being 146 and 132 respectively. For the Dg 
series, on the other hand, the only prominent term is W, the means for Wi and 
Wg being 173.8 and 149.5 respectively. 

We may recall that using Tj, with any value of D, W, S and G, the mean 
obtained was 118.2, so as far as purity goes we prefer Ti. If we use Tg, however, 
we should prefer Dj, when we will use any value of G, W, and S and get an average 
purity of 139. Should we need to use Dg, then we can use either G and S but 
it is important to use Wg, giving an average of 149.5. 

It is clear that the complicated behaviour of this system could not possibly 
have been followed without the use of a factorial design. Also, the andysis of 
variance was absolutely necessary in order to identify those effects and interactions 
which were significant. It would have been quite impossible to do this by 
qualitative inspection of the results in Table 11.15.(®) 

(g) Incomplete Two-Factor analysis : One Factor with Replication 

In our two factor analysis (Section (c)), we have g grades of material being 
processed at t different temperatures. Suppose, now, that we make t batches all 
at the same temperature from each of the g grades. Under these circumstances, 
the first unit of Grade 1 has no more correspondence with the first unit of Grade 2 
than it has with the second unit of Grade 2, etc. 

The two-factor analysis has thus become one factor (Grades) with t replica- 
tions. This amounts to pooling the degrees of freedom and the sums of squares 
of the T main effect in Table 11.1 with the Residual. This gives us Table 11.19 : 

Table 11.19 


Source of 
Variance 

Degrees of 
Freedom 

Components of 
Variance 

G 

Residual 
(within G’s) 

s — 1 

(t-l)+(t-l)(g-l) 
= g(t-l) 

CTo* 

Total 

gt — 1 



It is this type of analysis which was discussed in detail in Chapter VII 
Section (c). 

(h) Incomplete Three-Factor Analysis : Two Factors with Replication 

(i) In our complete three factor analysis we had s suppliers each supplying 
g grades of polymer which were processed at each of the t temperatures. Let 
us now assume that t batches were processed at the same temperature, i.e, we 
have two factors G and S, with t replications. Considering Table 11.2, we have 
that the G X T and T X S interactions and the T main effect all do not exist, 
The reason for the interactions mentioned disappearing is that, for example, the 
G main effect is by definition the same for all the replications, and any irregularity 
in it is synonymous with error, for which we use the residual. We then obtain 
Table 11.20. 

Consider the problem in which we wish to investigate a factor in a process, 
In which, say, we wish to determine its effect at three levels, Fi, Fg, and F 3 . 

(®) An example of a seven factor experiment on a factory process will be found in R, L. 
Cunningham and J, Ansel Anderson : ‘‘Cereal Chemistry,*’ Vol. XX, page, 482, 1943. 
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Table 11.20 


Soxirce of 
Variance 

Degrees of 

Freedom 

Components of 
Variance 

G 

S 

G X S 
Residual 

(g-i) 

(s-l) 

(g-l)(s-l) 

(g-i)(s-i)(t-i)+(t-iH 
(g-l)(t-l) + (s-l)(t-l) 

= gs(t— 1 ) 

a^o + ta^gs + tsa^g 

<j2o + tCT^gs + tga^s 

a^o ta^gs 

Total 

gst — 1 



Suppose that the material at one stage passes through a particular process which 
we know or suspect may give rise to variability. Then it will be necessary to 
eliminate the errors due to this other factor. 

The experimental design below would satisfy the requirements. A com- 
parison of the means of the Fs is valid since the disturbing effect of the machines 
Ml, M 2 and M 3 is eliminated. The same experiment also gives us a valid com- 
parison of the three machines, Mj, Mj and M 3 . The data from such an experiment 
would be analysed by the analysis of variance by rows and columns, previously 
discussed in Chapter VII (e) and Section (c) of the present chapter. 



This experimental design has been presented as a means of eliminating errors 
due to a particular factor, but of course it is exactly the same as for a two factor 
experiment, i.e. we might be equally interested in both the factors. 

As the design stands we cannot determine whether there is any interaction 
between the terms, for though we can compare, say (Mi — M 2 ) for F = Fi with 
(Ml — M 2 ) for F = F 3 , our comparison is pointless because we have no estimate 
of the error of either term and hence of theii difference. Without the estimate of 
the error we cannot estimate the significance of any apparent difference. 

We woxild get an estimate of error if we had repeated each experiment, when 
the design would be called a row and column or two factor analysis with replication. 
Consider the data in the table below. 



Nx 

N, 

Totals 

Dx 

—12 —5 

27 27 

37 

Dx 

—29 —22 

34 24 

7 

Totals 

—68 

112 

44 
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It refers to a two factor experiment N X D, each factor at two levels, and 
there are two observations for each point. We proceed as follows : — 

(1) Form the total of each cell, square these totals, sum these squares, and 
divide by the number of original individuals per cell, i.e. 

[(—12 —5)2 + (—29 —22)2 ^ (27 + liy + (34 + 24)2]/2 = 4585 

(2) Form the total for each level of D, square these totals, sum these squares, 
and divide by the number of individuals per total being squared, i.e. 

(372 + 72)/4 == 354.5 

(3) Similarly for each level of N, i.e. 

[(- 68)2 + 1122]/4 = 4292 

(4) Form the grand totd, square it and divide by the grand number of 
individuals, i.e. 

442/8 = 242, 

(5) Square every individual and sum these squares, i.e. 

(—12)2 4. 4 (27)2 4 . _ 4 242 = 4684 

The table of the analysis of variance is as below : 


Source of Variance 

Sums of Squares 

Degrees of 
Freedom 

Mean 

Squares 

Components of Variance 

Between Rows (D) 

Between Colximns (N) 

Row X Column 
interaction (D X N) 
Residual 

(2) — (4) »= 112.S 

(3) — (4) = 4050*0 

180.5 

(S) — (i) = 99*0 

nj — I » I 

n2 — 1 ** I 

(nj — j)ino — i) 

= I 

niUjCng — i) 

=» 4 

112.S 

4050.0 

180.5 

24-75 

njDaOi® + + <i| 

+ <l| 
njOu* + <J| 

^*0 

Total 

(s) — (4) “ 4442-0 

on 




ni rows, n2 columns, replications. 

ai2 = variance due to differences between rows. 

02 2 == variance due to differences between columns. 
ai2* = row X column interaction, 
aj — residual error. 

The Sum of Squares for the Row x Column Interaction is obtained by 
subtracting the total of the Sums of Squares for the other three components from 
the Total Sum of Squares, (5) — (4). 

In general, in testing the mean squares for significance, the row mean square 
and the column mean square are tested against the row x colunm interaction, 
for if they are significantly greater than the interaction, this can only be so if di* 
and a2® respectively are greater than zero. Similarly the existence of the interaction 
can be checked by comparing the interaction mean square with the residual. 

Here we test the Interaction against the Residual, getting a variance ratio 
of 180.5/24.75 = 7.3 for degrees of freedom ni = 1, nj = 4, this lying close to 
the 5% level of significance. 

Accordingly we should conclude that D and N interact, and we should insp^ 
the data broken down into one factor experiments. We can compare Di with 
Dj at each level of N, using the Student t test to compare the two means for 
D. We find that the difference between Di and Da is not significant for N* 
but is quite considerable for Ni. Similarly we can compare the difference between 
Ni and N2 for each level of D, when we find that the difference between the two 
levels of N is greater for Da than for Dj. 

It will be noted that this two factor analysis with replication is really a thr^ 
factor analysis with all the terms involving the third factor R, i.e. the R main 
effect and the R X N and R X D interactions, pooled with the residual. 
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(i) Doubly Incomplete Three Factor Analysis : One Factor with 
Double Order Replication 

(i) In the previous example, we had s suppliers each supplying g different 
grades of polymer, and from these t batches were processed, all at the same tem- 
perature. Now suppose that the s suppliers, instead of each supplying g different 
grades of polymer, supplied g nominally identical consignments of polymer, all 
of the same grade. The system then has the structure given by the sketch below : 

Supplier 1 Sup|>lier 2 Supplier 3 


Consignment 1 


V T y 


r T 2'. 


r 

Batch 1 


Each consignment is made up of a number of batches ; these are only indicated 
for consignment 1 of supplier 1, but should be understood for all the consignments. 
Since the consignments are nominally of similar grade, there is no point in com- 
paring the mean of all the Is, 1, T, etc. with the mean of all the 2s, 2, 2', 2" 
etc. The different consignments 1, 2, 3, etc. from Supplier 1 may differ among 
themselves, however, and similarly for the other suppliers. We can use the 
consistency of the batches processed from each consignment as a measure of 
error against which the difference between consignments can be compared. It 
may be noted that shouid the first batch in all instances be processed at tem- 
perature tj, the second batch in all instances at another temperature tg, etc., then 
we have an incomplete two factor analysis as in the previous section, the factors 
being supplier (S) and temperature (R). 

The variance can then be analysed into the components given in Table 11.21. 
Table 11.20 is derived from Table 11.21 by pooling the sums of squares and degrees 
of freedom of the G and G X S terms. The component of variance corxesponding 
to the G effect in Table 11.21 is more of the nature of an interaction than a main 
effect, but we will symbolically use G (rather than g) as a sufEx to the component 
of variance to denote its rather different character. 


Table 11.21 


Source of 
Variance 

Degrees of 

Freedom 

Components of 
Variance 

Suppliers (S) 

(s-l) 

+ tao^ + tgcr*s 

Between Consign- 
ments (G) 

(g-l) + (g-l)(s-l) 

= s(g— 1) 

CTo + taG® 

Within consignments 
(Residual) 

gs(t — 1) 

ffo* 

Total 

gst — 1 



(ii) A certain type of plant for concentrating dilute acid was subject to failure 
through corrosion. The figures below in Table 11.22 represent the quantity of acid 
obtained from a number of units before failure. From the mass of data 7 units 
were selected from each of 9 factories, and the 9 factories fell naturally into three 
groups according to the type of weak acid they were concentrating. 

To analyse this data we form the following terms : — 

(1) Square every individual and sum all the squares, i.e. 40^ + 49^ + 25* 
+ + 122* + 113* = 339,900 

100 




Table 11.22 


Type of 
Weak Acid 

Factory 

Throughputs 

Totals 

A 

1 

40 

49 

25 

30 

32 

25 

22 

223 * 


2 

40 

46 

41 

30 

50 

39 

31 

277 


3 

38 

38 

32 

33 

42 

45 

44 

272 

B 

4 

16 

50 

40 

54 

80 

64 

59 

363 


5 

49 

35 

70 

64 

23 

70 

88 

399 . 


6 

68 

20 

66 

36 

34 

45 

40 

309 

c 

7 

90 

90 

104 

62 

78 

88 

93 

605 


8 

84 

93 

73 

80 

210 137 

89 

766 


9 

152 

70 

122 

143 

72 122 

113 

794 


(2) Obtain the total for each block (factory), square these totals, sum these 
squares, and divide by the number of individuals in each block, i.e. (223 ^ + 
2772 + . . . + 7942)/7 = 310,016 

(3) Obtain the total for each super-block (type of weak acid), square these 
totals, sum these squares and divide by the number of individuals in each super- 
block, i.e. [(223 + 277 + 272)^ + (363 + 399 + 309)® -H (605 + 766 + 794)2|/21 
= 306,202 

(4) Obtain the grand total for all individuals, square it, and divide by the 
grand number of individuals, i.e. (223 + 277 + 272 + 363 + 

+ 794)2/63 = 254,985 

From these tenns we form the analysis of variance below in Table 11.23. 


Table 11.23 


Source of 
Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

Components of 
Variance 

Between super- 
blocks (type of 
weak acid) 

ni — 1=2 

(3) -(4) 

= 51,217 

25,609 

nj n, 55+ nj 05+05 

Between blocks 
(Factories) 
within super- 
blocks 

(^2 
= 6 

(2) -(3) 

= 3,814 

636 

_ ’ r 2 

^5 c; + 05 

Residual : 
Within blocks 
(within 
factories) 

nin2(n3 — 1) 

= 54 1 

(l)-( 2 ) 

= 29,884 

553 


Total 

n3^n2n3 1 

= 62 

(l)-(4) 

= 84,915 
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Hi — number of super-blocks 

Hg == number of blocks within a super-block. 

Us = number of individuals within a block. 

= variance due to differences between super-blocks. 

== variance due to differences between blocks within super-blocks. 

a® = variance due to differences between individuals within blocks. 

From the last column, “Components of Variance,” we see that can be greater 
than zero only if nga® + is significantly greater than Gg. Thus we compare 
the Between Blocks Mean Square with the Within Blocks Mean Square. Similarly, 
in general we test the Super-blocks Mean Square against the Between Blocks 
(within Super-blocks) Mean Square, for if it is significantly greater than the 

latter this can only be so because is greater than zero. 

Here we see that the Between Blocks Mean Square is not significantly greater 
than the Residual, so we can assume that = 0 and pool the two sets of Sums 
of Squares and Degrees of Freedom to get a new Residual of 33,698/60 = 561.6 
with 60 degrees of freedom. 

Testing the Between Super-blocks Mean Square against this residual we 
have a variance ratio of 25,609/561.6 = 45.6 for degrees of freedom ni = 2, 
n 2 = 60. For these degrees of freedom and the 0.1% level of significance the 
variance ratio needs to be not less than 7.8. It follows that the Between Super- 
blocks Mean Square is very much more significant than the 0.1% level. Solving 
the equations 3x7 = 25,609, ag* = 562, (we have shown ar 2 ® = 0) we 

obtain the value 1192 for gj*. We thus get 

= 1192 

Ga^ = 0 

Og® = 562 

The value of this analysis is that it shows that differences between factories 
within an acid group are not detectable, and that the larger part of the variability 
in throughput achieved is due to differences between acid groups. 

The use of the first point is that it enables us to pool data for all factories within 
an acid group for any detailed comparison: e.g., for comparisons between 
foimdries, in some instances there are 10 pots of foundry A and 1 pot of foundry 
B at one factory, 1 pot of foundry A and 10 pots from foundry C at a second 
factory, and so on. So long as we are confined to comparisons within a factory, 
our estimates of foundry differences will be very inaccurate. Having proved that 
we can pool the data for all factories within an acid group, our comparisons will 
be much more useful. 

The use of the second point is that it throws great emphasis upon the 
importance of the type of acid being concentrated. It can only be followed up 
using orthodox chemical considerations, of course. 

(j) Incomplete Four-Factor Analysis : Three Factors with Replication 

(i) In our complete four-factor analysis we had s suppliers working from 
each of r sources of raw material and providing g grades of polymer which were 
worked at each of t temperatures. If, instead, we work t batches all at the same 
temperature, we then have a three factor analysis with replication, the three 
factors being grade of polymer (G), supplier (S) and source of raw material (R). 

We derive the analysis of variance for this situation from the complete four 
factor analysis given in Table 11,8. We have the T main effect, the first order 
interactions involving T and the second order interactions involving T, all non- 
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existent. We can therefore pool their degrees of freedom and sums of squares 
with the residual, and we obtain Table 11.24 ; — 


Table 11.24 


Source of 
Variance 

Degrees of 

Freedom 

Components of 
Variance 

G 

(g-i) 

OTo^ 4“ tc^g 4“ tsa^rg 4* 
rta®g8 4- tsr<j*g 

S 

(s-l) 

ao* 4- to Vg + gtc*sr 4- 

rto V 4“ gtro®8 

R 

(r-1) 

0*0 4" tO*8rg 4“ gtO*sr 4" 
tSO*rg 4- gtSO*r 

G X S 

(g-l)(s-l) 

0*0 4” to*srg 4” rto*gs 

S X R 

(s-l) (r-1) 

Oo* 4“ to*8rg + gta*sr 

R X G 

(r-l)(g-l) 

Oo* 4- to*srg + tSO*ig 

o 

X 

CO 

X 

(g — 1) (s — 1) (r — 1) 

0*0 4" to*srg 

Residual 

(g-l)(t-l)(r-l)(s-l) 

+ (r-l)(g-l)(t-l) 

+ (t-l)(s-l)(r-l) 
4.(g-l)(t-l)(s_l) 

+ (t-l)(r-l) + (t-l)(3-l) 
+ (g-l)(t-l) + (t-l) 

Oo* 


= gsr(t— 1) 


Total 

gsrt— 1 



(ii) Suppose we carry out a three factor experiment, and replicate each point. 
This could be regarded as a four factor experiment, the fourth factor being the 
order of replication and when the complete analysis of variance is obtained pooling 
with the residual all factors involving A, viz., if N, D, and R are the tnree main 
factors, pooling A, A X N, A X D, A X R, A X N X D, A X D X R, and 
A X R X N. 

The arithmetic is lighter, however, if we total over all levels of A, i.e. sum 
over all replications, for each value of N, D, and R, thus getting a three factor 
table in which each figure is the sum of the replications. 

Consider the data in the table below : — 



Nx 

Na 


I 


D, 

nn 

sjHIIII 

Da 


Rx 

Ra 

Ri 

R* 


R> 

Ri 

R, 

Ai 

—3 

—1 

—8 

—6 


8 

10 

7 

A. 

M 

—6 

—3 

—5 


1 

0 

s 

S(A) 


—7 

—11 

—11 



10 

H 


The row S(A) is the sum of Ai and A^ for each value of N, D, and R. The 
data in this row is subjected to the same three factor analysis previously described, 
except for the following difference : 
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(1) Whenever we sum a set of squares, we divide by the number of replica- 
tions, here 2, in addition to the other divisors. 

Thus one of our three two-way tables is 



T>i 

D, 

Totals 

Ni 

—17 

—22 

—39 

N, 

11 

14 

25 

Totals 

—6 

—8 

—14 


The correcting factor is ( — 14)2/8 X 2 = 12.25 

The sum of squares for N is 

[■(_39)2 (25)2]/4 X 2 — 12.25 = 256.00 

The sum of squares for D is 

|(_6)2 + (-~82]/4 X 2 — 12.25 = 0.25 

The N X D interaction is given by 

r(_i 7)2 (—22)2 _|^ ( 11)2 (142)]/2 x 2 — 12.25 — 256.00 — 0.25 

= 4.00 

The Sums of Squares of the R, R X D, and R X N effects are obtained 
similarly. 

(2) An independent estimate of the Residual is obtained as the difference 
between the sum of the squares of all the original individuals and (the sum of 
the squares of the values for S(A) divided by the number of replications, here 2)y 

(—3)2 + (—1)2 + . . . (—3)2 — ((-10)2 + (_7)2 + . _ 42)/2 ^ 256 

(3) The Total Sum of Squares is given by the difference between the sum of 
the squares of the original individuals and correcting factor, i.e. 

(_3)2 _|_ (_i)2 + . . . + (—3)2 — 12.25 = 539.75 

(4) The N X D X R Interaction Sum of Squares is obtained as the difference 
between the Total Sum of Squares and the sum of all the other terms. 

The results of this analysis are as below : 


Source of 
Variance 

Sums of 
Squares 

Degrees of Freedom 

Mean 

Squares 

N 

256.00 

n — 1 = 1 

256.00 

D 

0.25 

d— 1 = 1 

0.25 

R 

1.00 

r— 1 = 1 

1.00 

N X D 

4.00 

(n-1) (d-l)= 1 

4.00 

D X R 

16.00 

(d_l)(r-l)=l 

16.00 

R X N 

0.25 

(r-l)(n-l)= 1 

0.25 

N X D X R 

6.25 

(a_l)(d-l) (r-l) = l 

6.25 

Residual 

256.00 

n d r (.a — 1) — 8 

32.00 

Total 

539.75 

r n d a — 1 = 15 



The N X D X R interaction is tested against the Residual, and is obviously 
not significant. In fact, all the terms except the N main effect are not significant 
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and can be pooled to give a new residual of 283.75/14 = 20.2 ; the N main effect 
then gives a variance ratio of 256.00/20.2 = 12.7 with degrees of freedom ni == 1, 
na = 14, this being more significant than the 1% level of significance. We can 
then calculate the means for Ni and Ng as — 4.875 and 3.125 respectively. 

It may have been in this example that the second replication (series Rg) may 
be at a different general level from the first (series Rj), due to an adjustment of 
the plant between the two series, a new batch of raw material, etc. The variance 
due to such an effect will be included in the Residual, and if large will give us an 
inflated estimate of the Residual, thus lowering the accuracy of our experiment. 

Accordingly we may take out from the Residual the Sum of Squares corres- 
ponding to the replication by forming the total for series Ri = 15 and for series 
Rg == ( — 29) and then calculating 

[152 + (— 29)2]/8 — 12.25 = 121.00 

Thus the Sum of Squares for the Residual is now 256.00 — 121.00 = 135.00 and 
its Degrees of Freedom are 8 — 1 = 7 giving a Mean Square of 19.29. 

In this case, therefore, we have gained considerably by taking out the series 
Sum of Squares, and we have increased the likelihood of detecting significance 
in the Terms very considerably. 


(k) Doubly Incomplete Four Factor Analysis : Two Factors with 
Double Order Replication 

(i) Let us now suppose that in the previous example the suppliers, instead 
of working to product g different grades, actually produced g nominally identical 
grades. We still process t batches from each of these also at the same temperature 
as before. 

The variance can be analysed now into between source of raw material (R), 
between suppliers (S), and the R X S interaction, and the two further components 
shown below in Table 1 1.25. The degrees of freedom and sums of squares for the 
new G term are obtained by pooling them together with the G X S, S X R and 
G X S X R terms. The Residual is the same as in Table 11.24. 

Table 11.25 


Source of Variance 

Degrees of Freedom 

Components of Variance 

R 

S 

R X S 

Between G 

Within R and S 

Within G (Residual) 

(r— I) 

(s — I 

(r — i)(»— i) 

(g — i) + (« — i) (r — i) + (g — i) (s — i) 

+ (g — i) (r — i) (s — i) = IS (g — i) 
gsr (t — i) 

+ *®*G + 

+ “G 
a2 

0 

Total 

gsrt — I 



(ii) The purpose of this experiment was to determine whether the position of a 
scroll of nitrated paper in a stabilising vat had any effect upon its nitrogen content. 
Two scrolls (S) were stabilised in each of two positions (P) and samples were 
taken from the top and bottom of each scroll (F). Duplicate determinations of 
the nitrogen content were performed on each of these eight samples. 
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At first sight this would appear to fall in the class of three factors (P x S x F) 
with replication, the class which we discussed in the previous section. 


Position 
of Scroll 

Scroll 

Number 

Fraction 
of Scroll 

% Nitrogen 

Pr 

Si 

Fi 

12.20, 

12.20 



F, 

12.26, 

12.26 


Sa 

Fi 

12.10, 

12.10 



Fa 

12.39, 

12.38 

Pa 

S'l 

Fi 

12.31, 

12.30 



Fa 

12.44, 

12.44 


S', 

F, 

12.23, 

12.21 



Fa 

12.37, 

12.35 


However, a moment's reflection will show that there is no true one-to-one 
correspondence between Si and S'l, or between Sa and S' a. Sj has no more 
association with S'l than with S'a. It is thus evident that the factor S is really 
a form of replication. 

The second point is that the duplication of analysis does not constitute a 
genuine replication, i.e. a genuine repeat of the whole experiment (to perform 
the latter we should have to use different scrolls in a different vat). 

To carry out the analysis of variance we might carry out the full analysis, 
^d then pool together the degrees of freedom and sums of squares on the lines 
indicated previously. However, it is less trouble to evaluate the relevant terms 
directly. In the calculation below the zero used is 12.20 and the scale has been 
multiplied by 100. 

(1) We need the square of the grand total divided by the number of individuals 
in that total (the usual correcting factor) : 

134 V16 = 1122.25 

(2) We need the two-way table for P and F. 



Fi 

Fa 


Pi 

—20 

49 

29 

Pa 

25 

80 

105 


5 

129 

134 


From this table the P, F, and P x F sums of squares are obtained in the 
usual manner : — 

P = (292 + 1052)/8 — 1122.25 = 361.00 
F = (52 + 1292)/8 — 1122.25 == 961.00 
P X F = (20=‘ + 49‘‘ + 25=“+80V4 — 1122.25 — 361.00 — 961.00= 12.25 

(3) The sum of squares between analyses is 

(02 + 02 + 6 * + . . . + 15 ^) — [(0 + 0 )“ + (6 4 - 6 )» + . . .]/2 = 5.00 

(4) The total sum bf squares is 

(02 + 02 + 62 — + 152)— 1122.25 ^ 1731,75 
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(5) The sum of squares for Error is obtained by difference. 
The resulting table is below : 


Sources of Variance 

Sums of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Between position in vat (P) 

361.00 

1 

361.00 

Between fraction of scroll (F) 

961.00 

1 

961.00 

F X P 

12.25 

1 

12.25 

Error 

392.50 

4 

98.12 

Between analyses 

5.00 

8 

0.62 

Total 

1731.75 

15 



The F X P interaction, tested against the Error is clearly non-significant ; 
we can pool it with the Error. The P main effect, the principal object of the 
experiment, can then be tested against this Error. The F main effect is not of 
direct interest, but can also be tested against the Error. It was advisable to have 
included F in the experiment, however, as it was quite possible that P would 
interact with it. 


(1) Trebly Incomplete Four Factor Analysis : One Factor with Triple 
Order Replication 

(i) In the previous example, the Factor R represented source of raw material. 
Suppose, on the contrary, that it represented operator in the factory. Then 
obviously, there is no point in comparing the first operator in all the factories 
with the second operator in all the factories, and we are only interested in this 
respect in the variation between the operators within a factory. 

The sums of squares and degrees of freedom for the new R term are given 
by pooling them for R and R X S in Table 11.25, thus giving us Table 11.26. 


Table 11.26 


Source of 
Variance 

Degrees of 

Freedom 

Components of 
Variance 

S 

(s-l) 

Co^+ ta^G + gta®R-(- gtra^S 

Between R within S 

(r-l) + (r-l)(s-l) 

= s(r— 1) 

ao^ + ta^G + gta®R 

Between G within R 

rs(g-l) 

Co^ + ta^G 

Within G (Residual) 

gsr (t — 1) 


Total 

gsr (t — 1) 



The system has the structure indicated below, where there are similar 
assemblies for subsequent super-blocks. 

Super-block 1 


Block 1 


Block 2 


Sub-block 1 


1 


I 

I2 


I 

S ub-block 2 

L I 


Sub-block 3 


L 


I, 


Sub-block 4 
1 I 

I. L 


The number into which the larger units are split can of course be anything from 
2 upwards. 
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(ii) Consider the data in the table below. We have an allegedly homogeneous 
batch of material contained in vats Vi to V^. The contents of each vat are wrung 
in a centrifuge and bagged. We select two bags Bi and Ba from each vatload 
and take two samples Si and Sg from each bag. On each sample we perform 
an analysis for percentage of an ingredient, in duplicate, Ai and Aa. 



Vi 

V2 

V3 

V4 


Ve 



^2 

Bx 

B2 

Bi 

B2 

Bi 


Bi 

Bs 

Bi 

B2 



S2 

Si 

S2 

Sx 

S2 

Si 

S2 

Si 

S2 

Si 

S2 

Si 

S2 

Si 

Si! 

Si 

S2 

Si 

S2 

Si 

S2 

Si 

S2 

:: 

39 

29 

28 

27 

39 

29 

u 

29 

29 

27 

28 

26 

27 

24 

25 

33 

30 

29 

30 

25 

27 

30 

31 

29 

39 

30 

31 

28 

28 

3 ° 

28 

30 

29 

27 

27 

25 

28 

26 

26 

29 

31 

31 

32 

29 

30 

29 

31 


In carrying out the analysis of variance it lightens the arithmetic to take an 
arbitrary zero, e.g. by subtracting 20 from every observation. We form terms 

(1) Square every individual and sum the squares, i.e. 

92 + 82 + 92 4- 72 4- . . . + 102 + IP = 3636 

(2) Obtain the total for each sub-block, square the sub-block totals, sum 
these squares, and divide by the number of individuals per sub-block, i.e. 

[(9 + 9)2 4 - (8 4 - 7)2 4 - (9 + 9)2 + . . . + (9 + 10)2 + (9 + uyy2 
= 3616, 

(3) Obtain the totals for each block, square the block totals, sum these squares, 
and divide by the number of individuals per block, i.e. 

[(9 4 - 9 + 8 + 7)2 + (9 + 7 4- 9 4 - 8)2 + . . . 4 . (9 + 10 + 9 + ll)2]/4 
= 3570.5 

(4) Obtain the totals for each super-block, square the super-block totals, 
sum these squares, and divide by the number of individuals in each 
super-block, i.e. 

(662 + 552 4 - 742 -f 822)/8 = 3534.25 

(5) Obtain the grand total for all individuals, square it, and divide by the 
grand number of individuals, i.e. 

4082/48 == 3468 

From these terms we form the analysis of variance in Table 11.27 below. 

Table 11.27 


Source of Variance 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Squares 

Components of Variance 

Between Super-blocks 
(vats) 

Between Blocks within 
Super-Blocks 
mags within vats) 
Between Sub-blocks 
within Blocks (Samples j 
within Bags) 

Between Individuals 
within Sub-blocks 
(^alyses within 
Samples) 

ni — - 1 

niCng — i) = 6 

n^njCng ~ i) 

= 12 

ningUgCn^ — i) 

= 24 

( 4 )-(s) 
= 66,25 

( 3 ) — ( 4 ) 
*= 36.2s 

(2) — (3) 

= 45-50 

(1)— (2) 
»= 20.00 

13.2s 

6.04 

3.79 

0.833 

n2n3n4Ci2 + ngn^CTa^ + ^4^^B + <^4® 

4- 

^4^ 

Total 

= 47 

(I) - ( 5 ) 
= 168.00 




n^ = number of super-blocks = 6 
n 2 = number of blocks within a super-block = 2 
ng == number of sub-blocks within a block == 2 
n^ = number of individuals within a sub-block = 2 

^ 4 ^ are respectively the variances due to the four terms listed 
under the heading ** Source of Variance.” 
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Proceeding to the analysis of Table 11.27, the Between Samples with Bags 
Mean Square, tested against the Between Analyses within Sample Mean Square 
(to test for the existence of — see the Components of Variance), gives a 
variance ratio of 3.79/0.833 = 4.6 for degrees of freedom ni = 12, ng = 24: 
this is more significant than the 0 , 1 % level of significance. ^ 

To test for the existence of we test the Between Bags within Vats Mean 
Square against the Between Samples within Bags Mean Square. The variance 
ratio is 6.04/3.79 = 1.6 for degrees of freedom ni = 6, Ug = 12 . This is less 
significant than the 20% level, so we can take it that org* does not exist. We 
pool the two terms to get a new estimate of of 81.75/18 = 4.53 

with 18 degrees of freedom. 

To test for the existence of Ci®, we test the Between Vats Mean Square 
against the Between Samples within Bags Mean Square. We have shown that 
does not exist, so we are testing (n 2 n 3 n 4 <Ji 2 + n 4 C 732 + against (n 4 a 3 * + 
C 42 ). The variance ratio is 13.25/4.53 = 2.93 with degrees of freedom = 5, 
Hg = 18. This is more significant than the 5% level of significance. 

We can now calculate as (13.25 — 4.53)/8 = 1.09 and as (4.53 — 0.83) 
/2 = 1.85, 

Our final conclusion thus is that the variability can be accounted for by the 
terms 

= 1.09 

Gg® === 0.00 

G 32 = 1.85 
G 42 = 0.83 

We see that there is an appreciable variability between vats, the variability between 
bags from a given vat is not large enough to be detected, the variability between 
samples within a given bag is quite large, in fact the most important component, 
and the variability between duplicate analyses on a given sample is quite appre- 
ciable. The total variance of a single analysis is thus (1.09 + 0.00 + 1.85 + 0.83) 
= 3.77. 

•It is further clear from this analysis of variance that the allegedly homogeneous 
batch is significantly inhomogeneous, and that the larger part of this inhomo- 
genity arises in bagging the material = 1.85). Each vat is apparently 
homogeneous = 0 . 00 ), but there is an appreciable difference between vats 
(gi* = 1.09). 

(m) An Incomplete Five Factor Analysis 

When we come to five factor analyses the number of different types of incom- 
plete analysis is considerable. We will discuss only onetype(^), but from such a 
viewpoint that it is hoped that the general method of treatment will be clear. 

The table below gives the data of an experiment on a process using a cellulose 
derivative. The final product was subject to fluctuations in a certain property, 
and it was required to find out at what stages the variability was arising. The 
cellulose derivative reaches the process in large homogeneous blends, jmd^ from 
these a number of batches are mixed. Normally the whole of each mix is put 
into a single truck and then dried, after which it is possible to observe the quality 
of the product. Under this procedure it is impossible to disentangle the effects 
of mixing from those of drying : to do this requires that a truck should contain 
parts of more than one mix (in order to compare mixes independently of their 
drying) and that a mix should be dried in more than one truck (in order to compare 

(^) A complicated example dealing with glass manufacmre is described by C. E. Gonld 
and W. M. Hampton : Supplement to Journal of the Royal Statistical Society, VoL 111, 
page 137, 1936. See particularly E. S. Pearson’s contribution to the discussion page 156 
This example is also discussed by Tippett, “Methods of Statistics, Section 10, 31, and 
by Kendall, “The Advanced Theory of Statistics,” Vol. II, page 202, where he treats it 
as a complete analysis. 
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dryings independently of the material of the mix). For practical reasons it was 
impossible to split a mix into more than two portions. Accordingly a mix was 
made and split into two portions, one portion going into one truck and the other 
into another truck. The next mix was also split and the two portions filled up 
the previous two trucks. A second pair of mixes taken from the same blend of 
the cellulose derivative were similarly tested. The whole was repeated on three 
subsequent blends. On every individual (that is, material from a particular truck 
and mix from a particular pair from a particular blend) a duplicate determination 
of quality was made. 


Blend 

Pair 

Mix 







Mix 

Pftir 


Numbers 

Numbers 

Numbers 







Totals 

Totals 

Total! 

I 

I 

I 

11 

i 6 

27 

17 

16 

33 

60 





2 

o 

o 

o 

14 

13 

27 

27 







27 


60 


87 



2 

3 

12 

12 

?4 

10 

8 

18 

42 





4 

6 

6 

12 

1 

3 

4 

16 







36 


22 


S8 

14s 

2 

3 

5 

9 

9 

18 

19 

rS 

37 

55 





6 

3 

I 

4 

6 

6 

12 

16 







22 


49 


71 



4 

7 

8 

8 

6 

ooo 

M M 


13 

7 

27 

13 

44 

»7 







31 


40 


71 

Z42 

3 

S 

9 

IS 

17 


22 

21 

43 

75 





lo 

21 

20 

41 

13 

13 

26 

67 







73 


69 


142 



6 

zz 

z8 

i8 

36 

19 

21 

40 

76 





Z 2 

6 

6 

12 

6 

4 

10 

22 







48 


50 


98 

240 

4 

7 

13 

ZI 

ZO 

21 

3 

2 

S 

26 






5 

6 

11 

I 

0 

1 

12 







32 


6 


38 



8 


8 

8 

z6 

$ 

6 

1 % 

29 





ZO 

4 

4 

8 

7 

8 

H 

*3 







24 


27 


SI 

89 


In the table the first two figures 11 and 16 are the duplicate determiuatiopg 
of quality on Mi? 1 Pair 1 Blend 1, first truck. The 27 is the total of 11 and 16. 
The next pair of figures 17 and 16 are the determinations on the second truck of 
mix 1, and the 33 following is their total. The last figure in the row is the mix 
total. The second row is for the second mix and is precisely identical. The 
third row has 27 as the total for the first truck, 60 as the total for the second truck, 
and 87 ^ the pair total. The same follows for the second pair for blend 1 with 
the addition of 145 as the blend total. ’ 


Prima facie this is a five factor analysis, the factors being Blends (B), Pairs fP). 
Mixes (M), Truck? (T) and Determination of Quality (A). It Is immediately 
apparent that it js not a complete five factor analysis, however, as A is obviously 
a simple replication : there is no distinction between all the first determinations 
as compared 'mth the second. Further, there is nothing special to distinguish 
the first pairs from the second pairs, so the pair main effect P will not exist and 
we will have a Between Pairs within Blends effect formed by pooling the sums 
of squares and degrees of freedom of the P and B X P terms. 

We will proceed to calculate this term and the B main effect. The correcting 
factor, the grand total over the grand number of individuals, is 
616764=5929 
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The Blend main effect is of course given by the blend totals 
(145® + 142® -f 240® + 89®)/16 - 5929 == 740.37 
The Between Pairs within Blends term is given directly by squaring the pair 
totals and dividing by the number of individuals therein, and subtracting the 
correcting factor and the Blend sum of squares, i.e. 

(87® + 58® + . . . + 51®)/8 — 5929 — 740.37 = 184.13 
We would of course get the same result if we formed a B X P table, calculated 
the B, P, and B X P (residual) and then pooled P with B X P. 

We now come to consider the M effect. There can be no M main effect, 
as the first mix in all pairs are not systematically distinguished from the second 
mixes : we therefore have that am® = 0. Further the M effect is only of interest 
within a given pair, which is within a given blend : this term is ombp®, and amb® 
and amp® are both zero. If we strike out am®, amb® and amp® from the components 
of variance of the M, M X B, and M X P mean squares, we find we have in each 
case left as the components of variance 
aa® + aa®mtbp + ata®mbp 

which is the same as for the M X B X P mean square. We therefore pool these 
four terms, M, M X B, M X P, and M X P X B, and call the resulting term 
‘‘Between Mixes within Pairs and Blends.’* The pooled degrees of freedom are 
(m — 1) + (m — 1) (b — 1) + (m — 1) (p — 1) + (m -- 1) (p — 1) (n — 1) 
= bp(m — 1). 

It is clear that this is a reasonable answer : looking at it in a rather different way, 
each pair of mixes contribute (m — 1) degrees of freedom, and there are bp pairs, 
so the total is bp(m — 1). 

To calculate this pooled M X B X P term we need a M X B X P table, i.e. 
a table formed by totalling over A and T. Such a table is already formed in the 
column headed “Mix Totals,” 60, 27, 4, 16, 55, etc. The total sum of squares 
for this table is 

(60® + 27® + 4® + . . . + 23®)/4~ 5929 = 1771.5 ‘ 

To obtain our pooled M X B X P term we could calculate in full M, M X B, 
BxP, MxBxP arid then pool them. We may note, however, that the 
Total of an M x B X P table is made up of these terms plus B, P, and B X P. 
These latter we already have, so the pooled M X B X P term is very easily obtained 
as 

1771.5 — 740.37 — 184.13 = 847.00 
Its degrees of freedom are 

bp(m — 1) = 4 X 2(2 — 1) = 8 

The T term is obtained in exactly the same wav as the M term. We require 
the term formed by pooling T, T X B, T X P, and T x B X P, so we work from 
the T X B X P table formed by totalling over A and M. The total sum of 
squares for this table is 

(27* + 60® + 36® + 22® + 22® + . . . + 27*)/4 — 5929.0 = 1274.5 
From this we subtract the sums of squares of B, P, and B X P as before, and 
we obtain as the sum of squares for the B X P X T term 
1274.5—740.37 — 184.13 = 350.00 
The degrees of freedom are 

bp(t— 1) = 4X2(2— 1) = 8. 

We now want the M X T interaction. As for M and T, this only exists 
within Pairs and Blends, so we pool M X T with MxTxB, MxTxP 
and 'M X T X B X P. We therefore work from the M X T X B X P tabl^ 
i.e. the table formed by totalling over A. The total sum of squares for this table is 
(27® + 60® + 0® + 27® + 24* + . . . + 15*)/2 — 5929.0 = 2316.00 
To obtain the sum of squares for our pooled M X T X B X P term we can 
subtract from this total all the other terms arising inanMxTxBxP table. 

Ill 



These sire 

(a) B and B X P : these we have as 740.37 + 184.13. 

(b) M, M X B, M X P and M X B X P : this we have calculated as 847.00 

(c) T, T X B, T X P, and T X B X P : this we have calculated as 350.00. 
The sum of squares is thus 

2316.00 — 740.37 — 184.13 — 847.00 — 350.00 = 194.50 

Its degrees of freedom are 

(ni-l)(t^l) + (m-^l) (t^l)(b-l) + (m^l)(t-l)(p-l) 

+ (m-l)(t~l)(p~l)(b--l) 

= bp(m— l)(t— 1) 

= 4x 2(2 — 1)(2~1) 

= 8 . 

The final term is the Between Analyses which is 

(IP + 16* 4- 17* + 16* + 0* + . . , 8*) — (27* + 33* + 0* + . . . + 15*)/2 

= 8278 — 8245 = 33.00 

The total sum of squares is the usual 

(IP + 16* + . . . + 8») — 5929 = 2349.00 

We can now enter all these terms into a table of analysis of variances. 



Mean Squares of 



Mean 

Squares 


Source of 
Variance 


Sums of 
Squares 

Degrees 
of Freedom 

Components of Variance 





246.79 

s a a 

Between 

Blenda 

B 

740.37 

3 

,+ "mtbp + tbp + 

•“mbp + *“*®bp + “'*P<»b 


Between Pairs 
within Blenda 

P 

P X B 

184.13 

4 

46.03 

".j+^rntbp+J™^ tbp + 
"®p>bp + "“»bp , 

BetweenTrucks 
within Pairs 

T 

T X B 

350.00 

8 

43-75 

+ »®mtbp + “^tbp 

and Blends 

T X P 

T X B X P 



105.87 

a a a 

Between Mizes 
within Pairs 

M 

M X B 

847.00 

8 

+ "mtbp + “'"mbp 

and Blends 

M X P 

M X B X P 




1 1 

Mix X Truck 

M X T 

194-50 

8 

24-31 

mtbp 

Interaction 

M X T X B 


within Pairs 

M X T X P 





and Blends 

Between 

M X T X B X P 

33.00 

32 

1.03 

0 * 

Analyses 


a 

Total 


2349.00 

63 . 


4 


The appropriate tests of significance can be worked out from the corhponents 
of variance. It may be noted, however, that if both the Between Mixes and 
Between Trucks terms are significant when tested against the Mix X Truck 
interaction, i.e. if both crj,bp and are greater than zero, then there is no 
valid test for the Between Pairs Mean Square This circumstance is similar to 
that arising in Table 21 (Chapter XI (d)). 

The Between Trucks Mean Square is not significant, but it is probably worth 
while calculating its component of variance. We obtain 
aa* = 1.03 
a*,, = 11.64 
= 15.40 
*cjt* = 4.86 

(Tp* = 0.00 

<Tb* == 10.05 

It is clear that the origins of variability are rather widespread, Mixing and 
its interaction with Trucks (drying) and Blends being responsible for the greater 
part of the variability. 
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CHAPTER XII 

MISCELLANEOUS ASPECTS OF THE ANALYSIS OF VARIANCE 


(a) Introduction 

In this Chapter a number of miscellaneous aspects of the use of the analysis 
of variance will be discussed. The next five sections contain matter which might 
be borne in mind in planning and analysing the results of factorial experiments, 
Section (g) is an account of a useful experimental device (the Latin Square), and 
the subsequent sections describe various applications of the analysis of variance. 

(b) The Use of C^ mponents of Variance(^) 

In the systematic discussion of the Analysis of Variance in Chapter XI for 
each type of analysis the components of variance corresponding to each mean 
square were given. 

It should be realized that the mean square only estimates these terms, and 
one must not regard them as precise algebraic identities. Thus in the complete 
three factor analysis (Table 22) we had 
31.08 estimates wa*ta + cto* 

161.08 estimates ao® 

It is evident that au* does not exist, i.e. is zero, and so we have two estimates of 
(Jo* which we can pool in the usual way, taking into account the degrees of freedom 
(i.e. the simplest way is to pool the sums of squares and the degrees of freedom 
and calculate the new mean- squares, estimating ao* as the former divided by the 
latter). 

This pooling is of course only valid when the estimates being pooled are 
consistent as tested with the variance ratio test. If '(w(j*ta + (Jo*) was signi- 
ficantly greater than cjo* then of course we would not pool. It is possible that 
(wota* + ao®) may be significantly smaller than ao*. This is very unlikely to happen 
and if it does then the meaning is rather obscure. Either it is an accident (the 
1 in 20 chance which will occur 1 in 20 times) and we can ignore it, or some human 
agency is entering in a certain way to bias the results. 

The Components of Variance are useful for indicating the sequence of tests 
in the testing of the various terms of significance. They are also useful for esti- 
mating the actual amount of variability each part of the process is giving rise to, 
in order to decide on which part of the process the control must be improved to 
increase the uniformity of the product. This application was discussed in Chapter 
VII (Section (c)) and Chapter XI (Section (m)). 

The coefficients of the components of variance given in the various tables 
in Chapter XI are strictly only valid when the supposed populations are infinite. 
If the population is made up of batches of raw material, we can comfortably 
suppose that there is an infinite population of batches even if we are only con- 
cerned with a few dozen or a few hundred. If the populations are factories, then 
there may be only two or three factories and unlikely to be any others built in 
the future. Under these circumstances it becomes a little difficult to regard the 
population as infinite, and strictly the coefficients of the components of variance 
become modified. (*) 

For practical purposes the modifications can be generally ignored, because 

(a) the sequence of tests of significance is unaltered, and therefore no mistakes 
will be made by using the unmodified coefficients. 

(b) the actual estimates of the component of variance using the simple 
coefficients should be multiplied by the factor (k—- l)/(k where k is the size of 
the population). It is only for k = 2 or 3 that the factor is appreciable, being 

(^) On this point see S. L. Crump : Biometrics Bulletin, Vol. 2, No. 1, page 7, 1946. 
(*) cf. H. E. Daniels, Supp. Joum. Roy. Stat, Soc. VI No. 2, 1939, p. 188). 
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0.5 and 0.66 respectively. For larger values of k the difference of the factor 
from unity is rarely important compared with the fairly large standard error that 
is of course attached to these components of variance. 

In the case where a factor is made up of arbitrarily chosen different levels 
of a continuous variable, the population can of course be reasonably regarded as 
infinite, but we are probably more interested in the actual mean values and it 
will probably be of no interest to calculate the cohaponent of variance. 

(c) Partitioning a Sum of Squares into Linear, Quadratic, etc. 

Components 

Frequently we have a factor at 3, 4, 5 or more levels, where the factor has a 
numerical nature, the levels being chosen at intervals on the range of the continuous 
variable, e.g. temperature. We may wish to know whether the relation between 
the independent variable and the dependent variable can be regarded as linear, 
or whether there is appreciable curvature. If the steps between successive levels 
have been- made equal, e.g. if the temperatures are 20, 25, 30, 35®C., then the 
question can be answered with the greatest of ease. 

Consider the data in Table 12.1. The four values of T represent four 
temperatures, increasing in equal steps, of nitration of cellulose in mixed acid of 
four different water contents, represented by W, this also varying in equal steps. 
The dependent variable whose values are given in the table is the viscosity of .the 
resulting nitro-cellulose. 

Table 12.1 



Tx 

T. 

Ts 

T. 

Totals 

w, • 

6 

32 

45 

63 

146 

W, 

24 

27 

45 

62 

158 

W, 

6 

24 

44 

45 

119 

W4 

1 

22 

23 

39 

85 

Totals 

37 

105 

157 

209 

508 


A straightforward two factor analysis of variance gives the results in Table 12.2* 


Table 12.2 


Source of 
Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

Temperature 

3 

4052.0 

1350.67 

Water Content 

3 

787.5 

262.50 

Residual 

9 

367.5 

40.83 

Total 

15 

5207.0 



Both the effects are obviously highly significant. We can proceed to partition 
each of the two sums of squares into three components, one degree of freedom 
for the linear term, one for the quadratic term, and one for the cubic term. 

For the linear term, the sum of squares is 
(3 T, + T 3 — T 2 — 3 Ti)^/NS k^ 

The T’s represent the totals of N observations, and Sk^ in the sum of the 
squares of the co-efEcients of the numerator. Here 
Sk2 32 + 12 + (-^1)2 _|_ (_3)2 ^ 20 
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Accordingly the linear component of the sum of squares is 
(3 X 37 + 105 — 157 — 3 x 209)“/4 X 20 = 4032.80 
The quadratic component of the sum of squares is 
(T4 — T3 — T2 + Ti)^''NSk“ 

= (37 _ 105 — 157 + 209)74 x 4 = 16.00 
The cubic component is 

(T* — 3 T, + 3 T 3 — Ti)/NSk* 

= (37 _ 3 X 105 + 3 X 157 — 209)74 X 20 = 3.20 
A ready check on the correctness of the arithmetic is obtained by noticing 
that the sum of the three components equals the total given in Table 12.2. 

The Sum of Squares for Water Content can be partitioned exactly similarly 
and we get the results in Table 12.3. 


Table 12.3 


Source of Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

Temperature : Linear 

1 

4032.80 

4032.80 

Quadratic 

1 

16.00 

16.00 

Cubic 

1 

3.20 

3.20 

Water Content : Linear 

1 

616.06 

616.06 

Quadratic 

1 

132.25 

132.25 

Cubic 

1 

39.20 

39.20 

Residual 

9 

367.50 

40.83 

Total 

15 

5207.00 



The linear components in each case are obviously significant, and the signi- 
ficance of the quadratic and cubic terms can be tested in the usual manner a^inst 
the Residual. It is interesting to note that the apparent bend over from W* to 
Wi for decreasing W is not significant. 

In the case where there are only three levels, there are 2 degrees of freedom, 
one for the linear component and one for the quadratic component. These sums 
of squares are respectively 

(Ti — Ts® /NSk*, Sk^! = 2 

(2 Tj — Ti — T3)V'NSkS Sk = 6 

For more than four levels, Fisher and Yates. “Statistical Tables for Biological, 
Agricultural and Medical Research” (Oliver and Boyd) give the appropnate factors 
up to 52 levels. 

In the case where two factors are numerical variables changing in eqiwl 
steps, their interaction can be split up in an analogous manner. Consider the 
3x3 table below (Table 12.4). 


Table 12.4 



X, 

Xa 

X 3 

Totals 

Yi 

17 

20 

24 

61 

Y, 

14 

19 

25 

■ 53 

Ya 

10 

18 

30 

58 

Totals 

41 

57 

79 

177 
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We can use the coefficients 

(Ti — T3)2/NSk2 

and (— T^ + 2T, ^ T,)/Ni:k^ 

to partition the two main effects into their linear and quadratic components, which 
we can tepresent as Xl, Xq, Yl and Yq respectively. We can also partition 
the 4 degrees of freedom for the interaction into that attributable to Xl Yl, 
Xq Yl, Xl Yq, and Xq Yq. To do this we set up a table of the type shown 
in Table 12.5. 


Table 12.5 


Xl 

Xq 

1 

-1 

0 

2 

—1 

--1 

Yl Yq 





1 —1 

0 2 

—1 —1 


To form a table of coefficients for obtaining Xl Yl, we take the coefficients 
for Xl and multiply them by the first coefficients for Yl to give the first row, 
by the second coefficient for Yl to give the second row, and similarly for the 
third row. For Yl we multiply the coefficients for Xq by those for Yl in a 
similar manner. 

Proceeding thus, we obtain the table of coefficients in Table 12.6, 


Table 12.6 


XlYl 

XqYl 

XlYq 

XqYq 

1 0 —1 

0 0 0 

—1 0 1 

—1 2 —1 

0 0 0 

1 —2 1 

—1 0 1 

2 0—2 

—1 0 1 

1 —2 1 
—2 4 —2 

1 —2 1 


Sk^ = 4 2k2 = 12 Sk2 = 12 Sk" = 36 

The Sk® quoted below each table is the sum of the squares of the coefficients. 
To obtain the sum of squares due to e.g. Xq Yl, we operate on the figures in Table 
12.4 by the coefficients for Xq Yl in Table 12.6, square this result, and divide 
by 2k ^ times the number of observations in each figure that has been multiplied 
by the coefficients. In the present instance the latter figure is of course 1, but 
if the figures in Table 12.4 were the totals of N observations then it would be N. 
Accordingly we have as the Sum of Squares for Xq Yl 

(—1 X 17 + 2 X 20 — 1 X 24 + 1 X 10 — 2 X 18 + 1 X 30)V12 
= 0.750 ^ ^ 


We can ^ now assemble the complete analysis of variance as in Table 12.7. 
The coluirm Sums of Squares*^ is obtained by squaring the figures in column two 
and dividing by column three. A good check on the arithmetic is given by the 
agreement of the sum of all the components with the sum calculated directly from 
the original observations. 
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Table 12.7 


Source of 
Variance 

Products of 
Coefficients 
and Data 

NSk® 

Sums of 
Squares 

Xl 

—38 

3x2 

240.667 

Xq 

— 6 

3x6 

2.000 

Yl 

— 3 

3x2 

1.500 

Yq 

— 3 

3x6 

0.5Q0 

Xl Yl 

13 

1x4 

42.250 

Xq Yl 

3 

1 X 12 

0.750 

Xl Yq 

5 

1 X 12 

2.083 

Xq Yq 

3 

1 X 30 

0.250 

Total 



290.000 


If we had not made this detailed analysis, but merely an analysis by rows 
and columns, we would have obtained Table 12.8. 

Table 12.8 


Source of 
Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

X 

2 

242.667 

121.333 

Y 

2 

2.000 

1.000 

Residual 

4 

45.333 

11.333 

Total 

8 

290.000 



This analysis, though better than nothing, is not nearly so informative as 
that of Table 12.7. Table 12.8 is completely unable to give any information on 
the possible X Y interaction. With Table 12.7 we of course have to pool the 
snaaller terms Xq, Yq, Xl Yq, Xq Yl, and against this Xl Yl is found to be 
significant. 

This device is applicable generally for any number of levels. From a table 
of the type of Table 12.5 we can form tables of the type of Table 12.6, and this 
procedure will work for mixed levels, e.g. a3 x4ora3 x5ora4x5. Further, 
there can be other factors present, and this is taken account of by adjusting N in 
the divisor NSk* to correspond to the number of original observations that have 
been summed to give us the table, as Table 12.4, on which we operate with the 
coefficients. 

In the case of four or more levels for each factor, the number of components 
rises rapidly, e.g. for a 4 X 4 experiment the residual has 9 d^ees of freedom 
and hence 9 components, for a 5 X 5 experiment the corresponding number is 16. 
The calculation of all of them is time-consuming and also really , unnecessary. 
Normally the only components that are likely to be of interest are Xl, Yl, Xq 
and Xl,Yq. Even the latter two are unlikely to be of interest if Xq and Yq 
respectively have proved to be small. Accordingly it normally suffices to calculate 
the total interaction in the usual manner and obtain a residual as the difference 
between it and the sum of the three components mentioned. In this case, of 
course, there is no explicit check on the arithmetic, so the steps taken need careful 
checking. 
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It might be emphasised again that this whole procedure rests on the assump- 
tion that each factor is increasing in equal steps, for example, 10®C, 15°C, 20®C, 
25°C. However, if we prefer, we can make the steps equal on a logarithmic 
scale, for example 1%, 2%, 4%, 8%. In this case, Xl will measure the fit to 
linearity on this scale, and similarly Xq and the higher terms deviations from 
linearity on this scale. 

(d) The Assumption Underlying Factorial Design 

A factorial experiment is much more useful, in that its interpretation is 
simplest and that its efficiency in the sense of givmg more information for a given 
number of experiments than the corresponding classical design is greatest, when 
the interactions are not significant. 

It may be interesting to note that the non-existence of interactions presupposes 
that the dependent variable y can be expressed as the sum of a series of functions 
of the independent variables Xi, Xg, etc., each of the latter involving one and only 
one independent variable, i.e. 

y = f(xi) + f(x 2 ) + (1) 

The functions can be an 3 rthing, of course, with no limit to the degree of complexity. 

To take a simple example, let 

y = Xl + 2x4 ( 2 ) 

If the two independent variables were at two levels both equal to 1 and 2, we 
would obtain the results in the table below for y : 



X4= 1 

Xa = 2 

Xi= 1 

2 

5 

Xl = 2 

3 

6 


It will be readily seen that there is no Xi X x* interaction. Thus the effect 
in y of changing Xa for Xi = l is 5 — 2 = 3 which is identical with that for 
Xl = 2, 6 — 3 = 3, i.e. the effect on y of changing Xj is independent of the level 
of Xj. 

Now consider a relationship of the type 

y = f(xi) f(xs) (3) . 

where the functions f(xi) and ffxa) can be of any form. 

To take an example of the simplest form, let 

y = Xl x\ (4) 

With the two independent variables at two levels 1 and 2, we could obtain the 
results as in the table below for y. 



Xa = 1 

Xj = 2 

Xi= 1 

1 

4 

Xi = 2 

2 

8 


It will be readily seen that there is ailx^^ x^ interac tionj Thus the effect 
in y of changing Xg from 1 to 2 for Xi = i is equal tb”4 1 = 3 but for Xi = 2 
is equal to 8 — 2=6, i.e, the effect on y of changing Xj is dependent on the level 
of Xl. 

It will be apparent that if we take logarithms throughout, equation (3) becomes 
log y = log f(xi) + log f(Xi) (5) 

and we now have an equation of the same form as (1), and the analysis will be 
non-interacting. 
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(e) The Use of Interactions as Estimates of Error 

In our original two factor (row and column) analysis what was termed the 
Residual included not only variance due to error but also to any possible interaction 
between the factors. If the latter exists and is large, then we have a large Mean 
Square for the Residual, and hence neither of the main effects is likely to be 
significant, i.e. we conclude the experiment knowing little more about the behaviour 
of the system than when we began. 

There are two possible courses of action open under these circumstances. 

We can replicate, and carry out an analysis as described in the section on 
Two Factor Analysis with Replication, which would give us an estimate of error 
independent of the Interaction. 

. Alternatively, we could repeat the series of observations, but with a third 
factor, held constant at a certain level in the first series, held constant at another 
level in the second series. The experiment would now be a three factor one, 
and we would be relying upon the second order interaction (P X Q X R) to be 
small or non-existent so as not to inflate the Residual. 

It may happen, however, that this second order interaction is large. A good 
example of this was in the Section on a Four Factor Experiment, when the four 
factor experiment was broken down into two three factor experiments, and the 
S2 had a large Residual almost certainly because it contained a large second order 
interaction. 

Under these circumstances there are two possible alternatives. We can either 
replicate, in which case we would analyse the data as in the Section on Three 
Factor Analysis with Replication. 

Alternatively, we could add a fourth factor, and use the third order interaction 
as an estimate of error. It may happen that this third order interaction is large, 
and so we will have an inflated estimate of error, and still not be much better off. 
However, if this interaction is large, it is valuable to know this, so it will have 
been as well to include this additional factor. 

A further example is in the Five Factor experiment discussed in Chapter 
XI (f), where the W X T X D interaction was very large. If the experiment 
had been carried out as a three factor one on W, T, and D, this large W X T X D 
interaction would have been used as the residual, and no conclusions could have 
been drawn. Actually, of course, the experiment included two further factors, 
S and G, and fortunately neither of these interacted, so a reasonably small residual 
was obtained. 

We can thus formulate some general ideas about the planning of factorial 
experiments. We have first to decide what are the main factors in which we are 
interested, and the number of levels at which we will test them. We then need 
to decide which are the factors which normally vary in the process and are likely 
to interact with our main factors, and include these, possibly only at two levels. 
We now need to decide whether the combination of factors so far introduced will 
give us a satisfactory residual as given by some non-existent high order interaction. 
To make this decision is of course frequently difficult. With a certain amount 
of prior information as to the general behaviour of the system we can frequently 
have some confidence in our conclusion, but often it is quite impossible to guess. 
We can only go on the general rule that it is not very frequent for high order 
interactions to exist, or if they do exist, to be very large, and be prepared to replicate 
the whole experiment if we end up with a large residual. Alternatively, if it is 
absolutely essential to be certain of getting results on the first experiment, then 
we must include a replication in the first experiment, perhaps at the cost of omitting 
a factor. 

It is probably generally ^esirable to include batch of raw material as one of 
the factors, for it may be that all batches do not react identically to particular 
treatments. In an example in the author’s experience, a crude chemical was 
subjected to a purifying process, and a factorial experiment established with a 
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greater significance than 1% that level 1 of a particular factor gave a much better 
product than level 2. This experiment was only carried out on a single batch of 
crude material, however, and subsequent experience showed that in general this 
conclusion was erroneous. If the experiment had been replicated on one or more 
batches this mistake would have been avoided. If we can afford to carry out 
replications, we should therefore replicate on different batches rather than within 
a single batch, this really being to introduce an additional factor, batch of raw 
material. 

(f ) The Amount of Detail Required in Reports 

In presenting in a paper or report the results of an investigation using a 
factorial design with an analysis of variance upon the results, it is desirable to give 

(a) the factors and rite levels at which they were used, 

(b) the analysis of variance. It is a useful practice to mark with asterisks 
the significant terms, one (*) for the 5% level, two (**) for the 1% level, and 
three (***) for the 0.1% level. 

(c) If we are interested in components of variance then we should calculate 
those which are significant. If we are interested in means then these should be 
given. In the case where two factors interact, a two-way table is necessary, 
e.g. Table 29. 

(d) In general it is not necessary to give the original data, particularly when 
it is extensive ; the statistics specified above contain the essential facts. How- 
ever, when the original data is not very extensive, there is no harm in giving it. 

(g) The Latin Square 

A statistical device that was originally introduced by Fisher to overcome 
difficulties arising out of differences in soil fertility in agricultural experiments is 
the Latin Square. 

Suppose we wished to investigate the effects of two factors, P and Q, each 
at four levels, upon a process. P might be the pressure at wHch cordite was 
pressed and Q might be the angle of the choke plate in the press. Then we 
would carry out the experiment below : i.e. 16 experiments are carried out, one 
for each vdue of Q for each value of P, thus PiQi, PiQ 2 , PiQs, PiQi, PaQi, 
PjQ, and so on. 



To the first order, our comparison of the Qs is independent of our com- 
parison of the Ps (i^oring interactions), and the results of such an experiment 
could be tested for significance by carrying out the analysis of variance for rows 
and columns. 

However, suppose that the material for pressing comes from the previous 
(incorporation) stage in homogeneous batches of such a size that only (say) ten 
pressings could be made from one batch. Then we could not cany out the above 
design, because it involves 16 pressings, and thus part of the series would be 
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carried out on one batch and part on another batch and possible differences 
between batches would be hopelessly mixed up with the P and Q comparisons. 

This difficulty can be avoided by the use of a Latin Square design. Consider 
the arrangement below, where A, B, C and D represent four different batches. 
Thus with batch A, four readings are taken in which for the first P is at P^ and 
Q at Qi, the second P at P^ and Q at Q 4 , etc. Similarly for the other batches. 



Qi 

Qi 

Qs 

Q 4 

Pi 

A 

B 

c 

D 

Pi 

B 

c 

D 

A 

P 3 

C 

D 

A 

B 

P 4 

D 

A 

B 

C 


It will be noticed that each row contains A, B, C and D each once and only 
once. Thus whatever disturbing effects possible differences between the batches 
may introduce occur equally for each row, and hence are completely eliminated 
when we take the average of the rows for our comparison of the Ps. 

Exactly similarly each column contains A, B, C and D each once and only 
once and dl disturbing effects due to differences between the batches are elim- 
inated from the comparison of the Qs. 

The results of such an experiment would be tested for significance with an 
analysis of variance on the foUovring lines : — 

(1) Square every observation and sum these squares. 

(2) Obtain the total for each row, square these totals, sum these squares, 
and divide this total by the number of individuals in each low. 

(3) Similarly for columns. 

(4) Similarly for batches. 

(5) Square the grand total and divide by the grand number of individuals. 

We then form a table of this analysis of variance (Table 12.9). 

Table 12.9 


Source of 
Variance 

Sums of 
Squares 

Degrees of 
Freedom 

Mean 

Squares 

Rows 

(2) -(5) 

n— 1 

• 

Columns 

(3) -(5) 

n— 1 


Batches 

(4) - (5) 

n — 1 


Residual 


(n_l)(n-2) 


Total 

(l)-(5) 

n^-. 1 

j 



n = size of square. 

The Residual Sum of Squares is as usual the difference between the Total Sum 
of Squares and all the other components. 

The various mean squares are tested for significance with Fisher’s variance 
ratio test. Here we are principally interested in the Row and Column effects, 
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corresponding to our factors P and Q. If the effects prove not significant we 
can regard it as there being no significant difference between the P (or the Qs). 
If the effect proves significant, we can calculate the means for each level of P 
(or Q). For making a detailed comparison of any two levels of P (or Q) each 
mean has the standard error given by the square ro6t of the Residual Mean Square 
divided by the square root of n. 

We have discussed the 4x4 Latin Square, but there is in general no restric- 
tion upon square size. Designs are available for sizes up to 12 x 12. 

One of the assumptions underlying the use of a Latin Square is that there 
should be no interaction between the factors. Since the existence of interaction 
in industrial chemical systems appears to be common, the Latin Square may not 
be of wide applicability in this field. 

An alternative method for dealing with the situation of restricted batch size, 
where we need to carry out a given number of experiments for a factorial experi- 
ment and the batch size of raw material is not large enough, is known as con- 
founding (see Chapter XIV). 


(h) Graeco-Latin and Higher Order Squares 

Consider the Latin Square in Table 12.10. It is possible to construct another 
Latin Square, using Greek Letters (Table 12.11) such that when they are super- 
imposed (Table 12.12) to each Latin letter corresponds all Greek letters and 
vice-versa. Such a square is called “Graeco-Latin.” 


Table 12.10 Table 12.11 



Aa 

BP 

Cy 

Da 

Ee 

Ba 

Ce 

Da 

Ep 

Ay 

cp 

Dy 

Ea 

Ae 

Ba 

De 

Ea 

AP 

By 

ca 

Ey 

Aa 

Be 

Ca 

Dp 


The applications of this to experimental design are obvious. If as in the 
previous example different columns were different angles, the rows different 
pressures, and Latin letters different batches, then the Greek letters could be any 
further factor such as rate of pressing. 

The analysis of variance proceeds exactly as in the previous section, except 
that now there is a further item derived exactly in the same way as the first three 
in Table 12.9, again with (n — 1) degrees of freedom. The Sum of Squares for 
the Residual, obtained fiy difference as before, now has (n — 1) (n — 3) degrees 
of freedom. 
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It is possible to superimpose on Table 12.12 a fifth factor which still satisfies 
the appropriate conditions, when the degrees of freedom for the Residual will 
be (n — - 1) (n — 4), or in this case 4. 

Generally speaking, except for n = 6 , on a square of side n we can super- 
impose n factors still leaving (n — 1) degrees of freedom for the residual. These 
designs can thus be extremely useful for rough surveys, as for example in a total 
of 25 runs we can get good estimates of 5 factors each at 5 levels. The disadvan- 
tages, of course, is that they do not estimate interactions : nevertheless, for a 
quick survey of a new field, in which we are principally interested in the main 
effects and propose to examine those which turn out to be of practical importance 
in more detail later, these designs are valuable. 

Fisher and Yates’ Tables (Table XVI) give the structure of such squares 
from n = 3 to 9. It is remarkable that for n = 6 nothing higher than a Latin 
Square is possible. 

(i) The Theory of Chemical Sampling (*) 

(i) The techniques outlined above can be made the basis of a scientific study 
of sampling, a field that has been not adequately dealt with in chemical industry. 

Thus consider the variances derived for the problem in Chapter XI (f). 
Between Vats = = 1.09 

Between Bags within Vats = = 0.00 

Between Samples within Bags = = 1.85 

Between Anal)rses within Samples = < 74 ^ = 0.83 

Suppose we have one analysis on one sample from one bag from one vat. 
This will have a variance of (1.09 + 0.00 + 1.85 + 0.83) == 3.77 or a standard 
deviation of 1.94. It will the 4 have a 95% chance of being less than 1.96 X 1.94 
= 3.80 out from the true mean (1.96 is the value of t for i nfini te degrees of freedom 
for the 5% level of significance). 

If this is too inaccurate an estimate of the quality of the batch, we might 
reduce the error in our estimate by doing duplicate analyses. It would then 
have a variance of (1.09 + 0.00 + 1.85 + 0.83 /2) — 3.35 or a standard deviation 
of 1.83. It is clear that this is very little improvement upon our original estimate 
of 1.94 based on a single analysis. 

We might take a pair of samples from a bag, and perform single analyses 
upon them. Our estimate would thus have a variance of (1.09 + (1.85 4 * 0.83) /2) 
= 2.43, or a standard deviation of 1.56. This is a definite improvement upon our 
earlier estimates. 

Alternatively we might take a pair of samples, one from one bag from one 
vat and the other from another bag from another vat. Our estimate of the mean 
batch quality would now have a variance of (1.09 + 0.00 + 1.85 + 0.83)/2= 1.88 
or a standard deviation of 1 . 37 . 

It is thus clear that it is much less profitable, if we are going to have the 
laboratory doing two analyses, to have them doing duplicate analyses upon the 
same sample or separate analyses from two samples from the same bag than have 
them doing separate analyses upon samples drawn from separate bags from 
separate vats. 

If we are dealing with a chemical substance, samples of which can be blended, 
such as liquids, powders, etc., then a further possibility is open to us. We can 
take a pair of samples, one from one bag from one vat and one from one bag 
from another vat, blend them, and carry out one analysis upon the blended sample. 

(•) Interesting papers on sampling problems are : D. J. Finney : Biometrics BiiUetin, 
Vol. 2, No. 1, 1946, page 1. C. S- Pearce : Journal of the Royal Statistical Society, Vol. 
evil, page 1 17, 1944. F. Yates : Journal of the Royal Statistical Society, Vol. CIX,page 12, 
1946. Chapter 17 of G. W. Snedecor’s “Statistical Methods*’ 4th Edition (Iowa State 
College Press) is also stimulating. 
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Our estimate would now have an error variance of ((1.09 + 0.00 + 1.85)/2 + 
0.83) = 2.30 or a standard deviation of 1.52. Or if we took four samples from four 
vats and blended them and carried out one analysis upon the blended sample 
our estimate would have an error variance of ((1.09 + 0.00 + 1.85)/4 + 0.83) = 
1.56 or a standard deviation of 1.25. Thus with a little extra trouble in sampling, 
and no more work on the part of the laboratory, we have got an estimate of the 
batch mean quality with an error standard deviation of 1.25, to be compared with 
the simple estimate from one sample of 1.94. 

It is on the basis of such an exploratory analysis of variance that we can 
deduce what is the error of our present sampling scheme and if the latter proves 
to have too high an error the analysis of variance will show the most economical 
method of improving its accuracy. 

(ii) Further examples of the application of the analysis of variance to the 
study of chemical sampling are given below. An allegedly homogeneous blend 
had been bagged. To test its homogeneity three analyses were performed on 
three samples taken from three bags from the blend. The material was nitro- 
cellulose, and it was analysed for nitrogen content and viscosity. 


Nitrogen (%) 



1 

2 

3 

Sample 




I 


12.23 

12.19 



12.24 

12.21 


mmm 

12.23 

12.22 


12.21 

12.22 

12.24 

II 

12.20 

12.23 

12.21 


12.19 

12.25 

12.23 


12.21 

12.22 

12.22 

III 

12.22 

12.21 

12.22 


12.20 

12.24 

12.22 


Viscosity (poises) 



1 

B 

3 

Sample\v 





WMM 



I 









128 

134 

114 

II 

127 

114 

127 

1 

132 

127 

119 




||BB 

III 
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The data was analysed as in Chapter XI (i), with results as below : 



Degrees of 
Freedom 

Nitrogen 

Viscosity 

Source of Variance 

Sums of 
Squares 

Mean 

Squares 

Sums of 
Squares 

Mean 

Squares 

Between Bags 

2 

24.51 

12.25 

267.56 

133.78 

Between Samples 
within Bags 

6 

10.23 

1.70 

427.11 

71.19 

Between Analyses 
witl^ Samples 

18 

25.33 

1.40 


34.11 

Total . . 

26 

60.07 


1308.67 



Units for nitrogen : hundredths of a per cent. 

In the case of Nitrogens, the Between Sample Mean Square is not significant 
but the Between Bags is, at the 1% level. In the case of Viscosity, the Between 
Samples Mean Square is not significant but the Between Bags Mean Square is 
nearly significant at the 5% level. 

It is thus apparent that there is appreciable inhomogeneity in the blend 
from bag to bag but not within bags. We can calculate the components of variance 
as below : 


Source of Variance 

Nitrogen 

Viscosity 

Between Bags 

1.20 

10.0 

Between Samples . . 

0.00 

0.0 

Residual 

1.48 

43.4 

Total 

2.68 

53.4 


The figure under ‘^Total’’ is the variance of a single analysis upon a sample 
from one bag. 

To effect improvement in the nitrogen figuxe, we need to concentrate atten- 
tion upon both operating sources of error. It is not sufiicient to take n samples 
and bulk them and perform one analysis on the blended sample : thus if n was 
4, the variance of the estimate thus obtained would be (1.48 + 1.20/4) = 1.78. 
This is an improvement, but no matter how many samples we bulk, so long as 
we perform only one analysis the variance will never be less than 1.48. If we 
performed two analyses upon a blended sample derived from four bags, the 
variance of the mean would be (1.20/4 + 1.48/2) = 1.04. 

With the viscosity determination, it is clear that our main efforts must be 
directed at obtaining more analyses. If we had four analyses upon one sample 
the variance of our estimate of the mean would be (10 + 43.4/4) = 20.8, or four 
analyses upon separate samples gives a variance of (10/4 + 43.4/4)== 13.3. 
The gain by taking separate samples is appreciable, therefore. 

(j) The Homogeneity of Data 

At the end of Chapter III on the comparison of means it was mentioned 
that complications arise in the calculation of standard deviations if the observations 
are not independent of each other. 

In general, with an analysis of variance between and within batches (One 
Factor with Replication) if we have m batches of n individuals then let us define 
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the simple variance given by the Total Mean Square (i.e. the variance we would 
calculate if we ignored the inhomogeneity of the data and regarded it as referring 
to nm independent individuals) as ot*. Then 

242 = 1 ' + 

mn — 1 




where oa* and ors* are the components of variances correspondmg to differences 
between batches and differences within batches. 

Rearranging, this can be expressed as 

mn — 1 

It is evident that ct® is always smaller than ax* = cta^ + dB*, and the error 
will be greatest when aB®/dA* tends to 0 and when m is large and n = 2. Under 
these circumstances at* is only half ax*, the correct value for the total va’riance. 

Consider the block of 30 observations discussed in Chapter VII (b), Analysis 
Between and Within Columns. Suppose we wished to calculate the standard 
deviation of the mean of these observations. 


If we ignored the fact that these 30 observations are not independent of each 
other, but are actually 10 groups, each group being 3 observations, we should 
calculate the variance from its ordinary definition which corresponds to terms 
(1) — (3) in that section divided by the total degrees of freedom, i.e. 203.87/29 
= 7.030, and the standard deviation of the individual observations would be 
2.65, and the standard deviation of the mean would be 2.65/v^30 = 0.484. 

However, this is a false result, for our mean is not the mean of 30 independent 
individuals, but actually the mean of 10 column means. 

The variance of each column mean is (<jb®/3 + oa*) and since there are 10 
column means the variance of the grand mean is (aB*/3 + oa*)/10, i.e. (2.17/3 
+ 5.22)/10 = 0,594. The standard deviation of the grand mean is thus 
Volw = 0.771. 

This true standard error of the grand mean of 0.771 is very much larger 
than the false one of 0.484 we obtained by ignoring the inhomogeneity of the 
data. If we had been using the standard error to compare the mean with some 
other value using the Student t test we would have fallen into serious errors, for 
not only is the true value of the standard error of the grand mean much larger 
than the incorrect one, but also the true degrees of freedom are only 9, not 29. 
This is why it is in general very desirable to plan experiments of this type in a 
form suitable for the application of the analysis of variance. 

Thus, if the thirty units (ten batches) referred to had been made of one 
consignment of raw material, and we wished to compare the grand mean of another 
consignment, it would be most convenient to make ten batches (thirty units) with 
the second consignment. This would then be suitable for an analysis of variance 
as below : 


Source of Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

Between Consignments 
Between Batches within 
Consignments 

Between Units within 
Batches 

(2-1)= 1 
2(10—1) = 18 

2 X 10(3— 1) = 40 



Total 

2 X 10 X 3 — 1 
= 59 




126 




It will be noted that it would not be possible to apply this technique if the 
experiment had been done haphazardly, unplaimed, so that units were missing 
from some of the batches and the numbers of batches made from each consignment 
were unequal 


(k) The Use of Logarithms in the Analysis of Variance 
^ We discussed earlier the possibility of using the logarithm of the dependent 
variable rather than the dependent variable itself in cases where deviations were 
likely to be proportional to the mean, i.e. constant if we use logarithms. 

An example of this is occurring in the case of acid plant throughputs dis- 
cussed in Chapter XI (i) (Table 11.22). The Between Factories Within Acid Group 
term is not significant, so we can form a pooled residual variance with 3 x 6 = 18 
degrees of freedom for each type of weak acid. These become 57.19, 405.21, 
and 1197.84 respectively. These are severely inconsistent when tested with the 
variance ratio test. However, the standard deviations are 7.56, 20.13, and 34.61, 
and when expressed as fractions of the means 26.7, 51.0, and 103.3, we get 0.294* 
0.396 and 0.334 respectively. It seems that these fractions are roughly constant* 
and we are encouraged to tiy an analysis of the logarithms of the data. 

The natural logarithms of one-tenth of the throughput was selected as being 
the most convenient function to handle. Residual variances of 0.04489, 0.22450, 
and 0.08891 were obtained, each with 18 degrees of freedom, for each of the three 
acid types as before. While not strictly consistent, these are much more so than 
the simple variable. The Analysis of Variance on the logarithm of the data is 
in fable 12.13. 


Table 12.13 


Analysis of Variance of Logarithm of Acid Plant Throughputs. 


Source of 
Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

Type of weak acid 

2 

11.4670 

5.7335 

Factories 

6 

0,6769 

0.1128 

Within Factories 

54 

6.4494 

0.1194 

Total 

62 

18.5933 



The conclusions to be reached from the above analysis are identical with 
those obtained from Table 11.23, the analysis of the simple data. 

This is a reassuring aspect of the analysis of variance : we can disr^ard the 
underlying assumptions with comparative impunity. 

(1) Other Transformations in the Analysis of VarianceC*) 

Sometimes in an analysis of variance the dependent variable is not a continuous 
variable but discontinuous. 

For example, in discussing blemishes in objects, the number of blemishes 
of course has the lower limit zero but can go to very large numbers. Technically, 
the distribution is probably Poissonian. Under these circumstances it is more 
appropriate to use the square root of the number of blemishes as the dependent 
variableC*). Where the number of blemishes are in many cases less than 10, it 
is slightly preferable to xise, where x is the number of blemishes, Vx + 1 /2 

{*) A very interesting paper on the effects of various transformations ina3x3x2x2 
experiment is by Thorp et al : Phytopathology XXXI, page 26, 1941. 

(•)M. S. Bartlett: Supplement to the Journal of the Royal Statistical Society, 3 : 68 
(1936), 
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In the case where the number of incidents x being counted has a finite uppei 
limit, being able to range from O to X, it is best to take the angle 0. 

' 0 = sin-^Vx/X 

For example, if x — 40, X = 120, then V x/X = Vl/S = 0.5774, and sin-^ 
0.5774 = 35.2®. 

In practice convenient tables of the transformation are available. (®) 


(m) Missing Values(’) 

Occasionally nms fail to produce results : the sample may get lost or any 
similar accident may happen. In the case of a two factor experiment, of r rows 
and c columns, one missing value may be efficiently estimated by the formula {®) 
^ _ rR + cC — T 
(r-l)(c~l) 
where r = number of rows 

c = number of columns 
R = total for the row with the missing item 
. C = total for the colunm with the missing item 
T = grand total 

Thus, in Table 12.1, if the value of T 2 W 2 (actually the observed value was 27) 
had been missing, we would have estimated it as 


^ _ 4x 131 + 4x78-^481 , 

(4— 1)(4-- 1) 

In carrying out the analysis of variance, this value is inserted as though it 
had actually been observed, and the analysis then proceeds normally. However, 
the degrees of freedom for the Residual and for the Total are both decreased by 
unity. 

It is wiser to regard conclusions from an experiment with a missing observa* 
tion with rather more caution than if it had been complete. That is to say, we- 
had better only attach something of the order of a 5% significance to what 
apparently is a 1% significance. 

When two observations are missing, they can be inserted by an iterative 
method. We insert for Xj an assumed value, say the grand mean, and then 
calculate the expected value of Xg. Using this value of Xg, we delete the assumed 
value for Xi, and then calculate its expected value, say X'l. Using this value of 
X'l we delete the previous value Xg and recalculate its new value X'g. Using this, 
new value X^g we delete X'l and recalculate a new value for Xj, Xi". After a 
very few cycles of this process the new values will be the same as the old. The 
analysis of variance then proceeds with two degrees of freedom subtracted from 
both the Residual and the Total. 

It is obvious that an inserted value can never be as good as an observed one, 
and so every effort should be made to avoid having to handle incomplete data. 
Where the lack of balance is on a large scale, the simplest case can be handled 
rigorously with little increase in trouble as in Chapter VII (d). Snedecor(*) (page 
281-301) discusses some more complicated instances.- However, in practice we 
can frequently obtain a moderately satisfactory result by discarding sufficient data 
randomly from the larger classes so that we have only one or two missing values 
which can be inserted by the methods above. 


(*) Fisher and Yates “ Statistical Tables for Biological Agricultuie, and Medical, 
Research” (Oliver and Boyd), Table XII in Second Edition. 

G. W. Snedecor : “Statistical Methods,” Iowa State College Press : Fouth Edition 
page 449. 

(V A full discussion with bibliography of nussing plot techniques is by R. L. Anderson : 
Biometrics Bulletin, Vol. 2, No. 3, page 41, 1946. 

{®)F. Yates: Empire Journal of Experimental Agriculture, 1, 129, 1933. 

*) G. W. Snedecor : “Statistical Methods applied to Experiments in Agriculture and 
Biology,” (Iowa State College Press), 4th Edition, 1946. 
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CHAPTER XIII 


BALANCED INCOMPLETE BLOCKS 

(a) Introduction 

A general description of the experimental design known as balanced incom- 
plete bloclp have been given in Chapter I Section (f), and will not be repeated 
here. This chapter will give the methods of computation for such experiments, 
the method and notation being based on Yates(^). 

Yates has developed a rather more sensitive method of analysing the results!^; 
but generally, particularly when the number of blocks in the experiment is not very 
large, the increase in accuracy is not very great, and the earlier method of analysis 
reproduced here will normally suffice. 

(b) Computation 

Table 13.1 gives the results of an experiment for comparing 7 treatments 
in 7 blocks of 3 units, there thus being 3 replications of each treatment. Block A, 
for example, contained treatments Tj, T 3 , and T5, and the figures in the table 
in the columns A to G are the yields obtained. 


Table 13.1 


Treatments 

A 

B 

C 

D 

E 

F 

G 

Totals 

Q 

V 

Corrected 

Treatment 

Means 


SO 

42 






183 

-84 

— 12.000 

57.238 

Iz 



ziS 

94 

94 



306 

163 

23.285 

92.523 

T 3 

76 



64 


80 


220 

69 

9.857 

79 . 09 s 




72 



S3 


156 

— 160 

— 22.857 

46.381 


44 




65 



163 

—81 


57.667 



102 



119 

92 


313 

254 

36.286 

105.524 

T 7 


38 


3® 




II 3 

— 161 

— 23.000 

46.238 

Totals 

170 

iSz 

281 

196 

278 

325 

122 

1454 

0 

0,000 



In Yates’ notation — 


Number of treatments 
Number of units per block 
Number of replications of each treatment 
Number of blocks 
Total number of units 
Sum of all N observations 
Sum of all r observations for treatment 1 
Sum of all k observations for block 1 
Error variance of a single observation within 
blocks of k units 


t = 7 
k= 3 
r = 3 
b == 7 

N = tr = bk = 21 
G = 1454 
Tx 
Bi 


For each of the t treatments we calculate the quantities Q given by 

Q, = kTi — Bi — B2 — . . . — Br 

where the Bs for Qi are the totals for the r blocks containing treatment 1. For 
example, 

Qx = 3 X 183 — 170 — 182 -- 281 = —84 


The Sum of Squares for treatments is then 
t-1 7 — 1 


Nk(k— 1 ) 




21 X 3(3—1) 
= 7665.905 


(842 4. 1632 + . . . + 1612 ) 


(^) F. Yates : Annals of Eugenics, Vol. 7, page 121, 1936. 
(2) F. Yates : Annals of Eugenics, Vol. 10, p. 317, 1940. 
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The Sum of Squares for Blocks is obtained from the Block totals m the usual 
manner : — 

SB* — G*/N 

= j (170* + 182* + ■ . . + 122*) — 1454*/21 = 6725.810 

The Total Sum of Squares is also as usual 

502 + 422 + 912 _]. 1162 + _ . 372 _ 14543/21 = 15057.810 
We enter up these items in a table of analysis of variance and obtain the Error 
Sum of Squares and Degrees of Freedom by difference. 

Table 13.2 


Source of 
Variance 

Degrees of 
Freedom 

Sums of 
Squares 

Mean 

Squares 

Blocks 


6725.810 

1120.968 

Treatments 


7665.905 

1277.651 

Error ! 

1 


666.095 

83.261 

Total i 

20 

15057.810 



The Error Mean Square is and the standard error of the difference between 
any two treatment means is 

''N(k— 1 ) '* ~ V 21(3 _i) X 83.261 

= 8.448 

To form these treatment means, corrected for block differences, we obtain the 
t quantities. 

Q - - 21 ^) (-«") = -“.000 

Th^ ys represent the deviation of each of the t treatments from the grand mean, 
which is G/N = 1454/21 = 69.238. The corrected treatment mean for treat- 
ment 1 is thus 

Vi + G/N = —12.000 + 69.238 = 57.238 

Should we also be interested in the Block means, in the case of designs such 
as the present example, in which every pair of blocks has the same number of 
Treatments in common, we can estimate the corrected block means by calcxilating 
Q's similar to the Qs, e.g. 

Q^a = 3 X 170 — 183 — 220 — 163 = —56 


and V A = Qa = 56 ^ — 

"^NCk — 1) 21(3 — 1) ~ 


- 8.000 


whence the corrected mean for block A is (69,238 + (—8.000)) = 61.238. 
More generally, we obtain the corrected block mean as, here, 

3 (170.000 — 57.238 — 79.095 — 57.667) + 69.238 = 61.238 


In the particular case discussed, where each pair of blocks has the same 
number of treatments in common, the standard error of the difference between 
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corrected block means is the same as that quoted above for corrected treatment 
means. In the more general case, the variance is, on the average. 

A , (t— I)(t — k)N 
t(k-l)(b-l)j 


Vb = 


' 6 

( 


1 + 


which of course gives identical results in the present instance. 


(c) Possible Designs 

A list of possible designs is given in Fisher and Yates’ Tables (Table XVII), 
Broadly speaking, if the number of treatments and the size of block are both 
determined, then the minimum number of replications of each treatment is fixed. 
If we are anxious to have only a small number of replications we can often avoid 
the straightforward solution. For example, working with blocks of 4, to compare 
6 treatments then for balance we require 10 replications. If we are of the opinion 
that 10 replications is more than sufficient for our purpose, we can instead use the 
design for 7 treatments which requires only 4 replications : the seventh treatment 
can be a replication of one of the 6 or an additional one as we choose. 

The most useful designs are that already given (k = 3, b = 7, t = 7, r = 3, 
that is to say,’ 3 imits per block, 7 blocks, 7 treatments, 3 replications) and those 
in Table 13.3. 

Table 13.3 

k = 3, b = 4, t = 4, r = 3 k = 4, b = 5, t = 5, r = 4 


A 

X 


X 

X 

B 

X 

X 


X 

c 

X 

X 

X 


D 


X 

X 

X 


A 

X 

X 

X 

X 


B 

X 

X 

X 


X 

C 

X 

X 


X 

X 

D 

X 


X 

X 

X 

E 


X 

X 

X 

X 


k = 4, b = 7, t = 7, r = 4 



Fisher and Yates’ Tables give a number of other interesting solutions. For 
example, 9 treatments can be compared in blocks of 3 with 4 replications, or 13 
treatments in blocks of 4 with 4 replications, or 16 treatments in blocks of 4 with 
5 replications, or 21 treatments in blocks of 5 with 5 replications. 

(d) Other Uses for Symmetrical Incomplete Blocks 

It might be pointed out that though as indicated these designs were intro- 
duced to evade the difficulty of restricted block size, they can be adapted for 
other purposes. For example, if we had a two factor experiment, each factor 
at 7 levels, complete execution would require 7* = 49 units. We could, however, 
obtain satisfactory estimates of the two factors by use of the design discussed, 
which involves only 21 runs, and which gives the variance of the difference between 
two corrected means as 6ak®/7. The complete experiment would have given it 
as 2crk®/7, so our error in the incomplete case is larger, but may still be small 
enough for our purposes. 
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The alternative classical solution of this problem would be to carry out the 
treatments Ti to Ty with B constant at Ba and Ba to Bg with T constant at Ti 
which would require 13 xmits or runs. In the absence of replication the experi- 
ment is most unsatisfactory, for there is no estimate of error and -no means of 
testing for significance. If we replicate all treatments, making a total of 26 runs, 
w e then get the comparison of our means with a variance o-k^. Comparing with 
the balanced incomplete block arrangement, we have done 24% more work and 
got an error variance 14% larger. 

CHAPTER XIV 

CONFOUNDING : THE PROBLEM OF RESTRICTED BLOCK SIZE 
IN FACTORIAL EXPERIMENTS (i) 


(a) The Algebraic Expressions for Factors 

In Chapter I we mentioned that for factorial experiments of the type 2" 
(n factors all at 2 levels) the number of runs required, being 2^, rapidly increased 
and for 4, 5 and 6 factors is 16, 32 and 64 respectively. These numbers may be 
too large to get into a single block, and the problem then arises of how to split 
the experiment up into smaller units. 

Let us introduce an appropriate symbolism. Let the small letters p, q, r, 
etc., stand for the condition that the factor referred to is at its upper level. For 
example rs would mean that p and q are at their lower levels and r and s at their 
upper levels. Let the capital letter P denote the difference between the sum of 
all the upper levels of P and the sum of all the lower levels of P, and similarly 
for the other factors. Then for a three factor experiment, 

P = pqr + pq + pr + p — qr — q — r ■— 1 
where 1 implies all the factors at their lower levels. Algebraically, this is 
equivalent to 

P = (p_l)(q+l)(r+l) 
and we can show that 

Q-(P+l)(q-l) (r+l), 

R = fp+ l)(q+ l)(r~a 

The first order interaction PQ is given by the difference between P at the 
lower level of Q, namely, 
pr + p — r — 1 

and P at the upper level of Q, namely, 
pqr + pq — qr — q. 

This gives 

PQ = pqr + pq — qr — q — pr — p + r + 1 
and algebraically this is equivalent to 

PQ=(p-l)(q-l)(r+l). 

It will be noted that we get exactly the same result if we regard PQ as the 
difference between Q at the lower level of P, namely, 
qr + q — r — 1 

and Q at the upper level of P, namely, 


pqr + pq — pr — p : 
for this gives 


PQ 


Similarly 


= pqr + pq ~ qr — q -- pr — p 4- r -h 1 
= (P-1) (q-1) (r+i). 


QR = (p + 1) (q — 1) (r — 1) 
RP =:(p_l)(q+l)(r^l). 


(^) For discussions of confounding see R. A. Fisher : **The Design of Experiments,*’ 
Chapter VI (Oliver and Boyd), and F. Yates : *The Design and Analysis of Factorial 
Experiments (Imperial Bureau of Soil Science). 
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The second order interaction PQR is rather more complicated : we can regard 
it as the difference between PQ at the lower level of R and PQ at the upper level 
of R. 

The former is 

(PQ)ri == pq — q — p -f 1 
and the latter is 

(PQ)R 2 = pqr — qr -- pr + r 
so the difference is 

PQR == pqr — qr — pr + r — pq + q + p — 1 

= (P-I).(q-I)(r~-1). 

As with a first order interaction, so with a second order interaction there are 
complementary ways of looking at it ; we might have regarded it as the difference 
between QR at Pi and QR at Pg, or as the difference between RP at Qi and RP 
at Qs- 


(b) Confounding with Three Factors 

Now consider that we have a three factor experiment, which will take 2® = 8 
runs, and we need to put this experiment not into one block of 8 runs but into 
2 blocks of 4 runs, the block size admitting only up to 4 runs. If we consider 
that we are mainly interested in the main eflFects and the first order interactions, 
and are willing to sacrifice ihe second order interaction to achieve this end, we 
can allocate the treatments to the two blocks as in Table 14.1. 


Table 14.1 


Block 1 

Block 2 

pqr 

1 

r 

qr 

P 

pr 

q 

pq 


The second order interaction PQR is now lost as it is identical with the possible 
block differences, for the effect 

pqr 4- r + P + q — 1 — qr — pr — pq 
is both PQR and the difference between Block 1 and Block 2. This confusing 
of effects with block differences is known as “confounding.’’ 

We have now to assure ourselves that our estimates of the main effects and 
first order interactions are not affected by possible block differences. Suppose 
that Block 2 has a bias such that all the four results contained in it come out d 
higher than they would have done if they had been in Block 1. 

We have 

P = pqr H- (pq + d) + (pr + d) + p — (qr + d) — q — r ■— (1 + d) 

= pqr + pq + pr + P — qr — q — r — 1 
as before, the biases d vanishing, and similarly for the two other main effects. 

Also 

PQ = pqr + (pq + ci) — (qr + d) — q — (pr + d) — p + r + (1 + d) 
= pqr +pq — qr — p — pr — p + r+ 1 
as before, the biases d again vanishing, and similarly for the other two first order 
interactions. 

(c) Confounding with Four Factors 

With four factors, the total number of runs required is 2^ = 16. The main 
effects are of the type 

p == (p __ 1) (q + 1) (r + 1) (s + 1), 
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the first order interactions of the type 

PQ = (p-l)(q-l)(r+l)(s+ 1), 

the second order interactions of the type 

PQR= (p-l)(q-l)(r-l)(s+ 1), 
and the third order interaction is 

PQRS = (p _ 1) (q - 1) (r - 1) (s 1). 

If we expand the latter it becomes 

PQRS = pqrs — pqr — pqs + pq — prs + pr + ps — p — qrs 
+ qr + qs — q + rs — r — s + 1. 

If we wish to confound in 2 blocks of 8, and select PQRS as the term to be 
confounded, then we merely need to place all those terms with a + sign in one 
block and all those with a — sign in the other block, as in Table 14.2. 


Table 14.2 


Block 1 
Block 2 


pqrs + pq + pr + ps + qr + qs + rs + 1 
pqr + pqs + prs + qrs + p + q + r + s 


To confound in 4 blocks of 4, we need to select 3 terms to correspond to the 
3 degrees of freedom that we confound with block differences. It is found that 
while we can select any two arbitrarily, the third is then xmiquely determined. 
The third is determined by the rule that we multiply the first two together by 
the ordinary laws of algebra, but wherever a squared term appears we place it 
equal to unity. Thus if the two selected are PQ and RS, then the third is PQRS : 
if the two selected are PQR and QRS, then the third is PS. 

A little experiment will show that with 4 factors to be confounded in 4 blocks 
of 4 it is impossible to avoid including one first order interaction amongst the 
three that are to be confounded. 

One method of dividing the treatments into their appropriate blocks is to 
expand two of the selected interactions which are being confounded, as in Table 14.3. 


Table 14.3 


PQR 

QRS 

pqrs — pqs + pqr — pq — qrs + qs — qr + q 
pqrs — pqs — pqr + pq + qrs — qs — qr + q 

PQR cont 
QRS cont 

— prs + ps — pr + p + rs — s + r — 1 

— prs + ps + pr — ^ p — rs + s + r — 1 


Where one 
in one block. 
Table 14.4. 


treatment has the same plus sign in both expansions, we place it 
Similarly , H , and h form the other blocks, as in 


Table 14.4 


Block 1 
+ + 

Blopk 2 

Block 3 
+ — 

Block 4 
— + 

pqrs 

pqs 

pqr 

pq 

q 

qr 

qs 

qrs 

ps 

prs 

P 

Pr 

r 

1 

rs 

s 
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There is an alternative method of writing down the treatment which in the 
more complicated cases is less troublesome. We consider the three interactions 
which we have selected for confounding, here 

PQR, QRS, PS. 

We place in the first block those treatments which have an even number of letters 
in common with the letters of these three terms. The treatment (1) is obviously 
satisfactory, for it has none in common (and zero is an even number). The 
treatment q r is also satisfactory, for it has 2 letters in common with both PQR 
and QRS and none in common with PS. The two others satisfying the condition 
are prs and pqs. This has given us one block, actually equivalent to Block 2 in 
Table 14.4. To get the next block, we select any treatment which has not yet 
occurred, say p, and multiply each of the terms in our first block by this, using 
the convention as before that e.g. p® = 1. Thus we get 
pqs X p = p,2 qs = qs 
qr X p = qpr 
prs X p = p^ rs = rs 
1 X p = p 

Accordingly, (p, rs, qpr and qs) form our second block, which actually is Block 3 
of Table 14.4. The next block is formed by selecting a treatment which has not 
yet occurred, say q, and multiplying the term of the first block in turn. This will 
give us Block 1 of Table 14.4. Selecting any treatment not yet mentioned, say s, 
and carrying out the same operation given as Block 4. 

(d) Confounding with Five Factors 

With five factors we require a total of 2® = 32 runs. Accordingly there are 
a number of possibilities according to the size of block we have available. 

For splitting into 2 blocks of 16 we obviously choose to confound the highest 
order interaction, PQRST. To allocate the treatments into the two blocks, we 
can expand (p — 1) (q — 1) (r — 1) (s — 1) (t — 1) and place all treatments with 
a + sign in one block and all those with a — sign in the other block. Alter- 
natively, we get the same result by taking for one block all those treatments with 
an even number of letters in common with PQRST : these are (1) (no letter in 
common), the ten treatments of the type pq, qr, pr, ps, etc., and the five treatments 
of the type pqrs, qrst, etc. 

For splitting into 4 blocks of 8, the best set of interactions to be selected for 
confounding is of the type 

PQR, RST, PQST 

which avoids losing any first order interaction. For allocation into the four 
blocks, we can expand any two, say PQR and RST, and distribute the treatments 

according to whether their signs in the two expansions are + +, + — » b, or 

, as in the previous section. Alternatively, we can take as the first block 

those treatments with an even number of letters in common with the interactions 
selected for confounding. These are 

(1), pq, St, pqst, qrs, qrt, prs, prt. 

The subsequent blocks are obtained by multiplying each of these treatments by 
p for the second block, s for the third block, and r for the fourth blocL 

For splitting into 8 blocks of 4, the best we can do is to lose 2 first order 
nteractions. A typical set is 

PQ, RS, PRT, QRT, PST, QST, PQRS 

One method of building up the treatment combinations for the eight blocks is to 
expand any three (the third not being the product of the first two : e.g. if we 
select PQ and RS as the first two, then the third cannot be PQRS) and flien the 
eight blocks are made up according to whether a treatment has the signs in the 
tluee expansion + + +> + H > H > H h +> 1 > > 
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or h- The alternative method is, however, less arduous. Our first block 

would be 

(1), pqt, rst, pqrs. . . , . ^ 

and the other seven are obtained by multiplymg by treatments not hitherto 
mentioned. For example, to get our next block we can multiply by p, getting 

p, qt, prst, qrs. 

For the third block we can multiply by q, getting 

q, pt, qrst, prs, 

and so on, till all treatments have occurred. 

(e) Confounding with Six Factors 

With six factors we require 2® = 64 runs. In confoxmding in 2 blocks of 32 
we obviously choose the highest order interaction PQRSTU to be confounded. 
For four blocks of 16 the most satisfactory set for confounding is 
PQRS, RSTU, PQTU, 
and for 8 blocks of 8 

PQR, RST, PSU, QTU, PQST, QRSU, PRTU. 

In all these cases we avoid losing any first order interaction. To get the 64 runs 
into 16 blocks of 4, however, losing 3 first order interactions is unavoidable. A 
typical set of 15 interactions for confounding is 

PQ, RS, PRT, QRT, PST, QST, PQRS, TU, PQTU, RSTU, 
PRU, QRU, PSU, QSU, PQRSTU. 

For allocating the treatments to their appropriate blocks the same method 
as hitherto can be applied. The first block is 
(1) pqrs, rstu, pqtu 

and the subsequent blocks can be obtained by mutliplication. 

(f) Computation of the Results of a Confounded Experiment : 

An Example 

In a confounded experiment the computation proceeds exactly as though the 
experiment had not been confounded, with the sole difference that we do not 
compute those interactions which we have confounded. Instead, we take the 
block totals, square them, divide by the number of treatments occurring in each 
block, and subtract the usual correcting factor (grand total squared over grand 
total number of observations). The degrees of freedom for this sum of squares 
is of course one less than the number of blocks. We would get an identical result 
if we calculated the sum of squares for the interactions being confounded, and 
then pooled them. 

As an example, let us consider an experiment on a process employing a 
mixed catalyst made up of three components. We wish to vary catalyst com- 
ponent A at 4^ levels and components C and D each at 2 levels, ^d we measure 
the yield as the dependent variable. A straightforward factorial experiment 
would require 4 X 2 x 2 = 16 nms. However, it was necessary to confound 
the experiment in blocks of 4, 4 blocks thus being required. 

The first problem is how to confound with a factor at 4 levels. Yates has 
given a method by which it can be regarded as two factors each at 2 levels. If 
we regard the two factors as A' and A",, with suffices 1 and 2 to indicate their 
lower and upper levels, and if the four levels of the four level factor are to a 4 , 
then Table 14.5 gives the appropriate allocation. 

Table 14.5 



A'l A% 

A"x 

A"x 

3,2 3.8 

3i a4 
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With this allocation 

== ^4 + 3-3 — ^2 — 2ll 

A = 3.4 as — 3,2 -{- 3i 

and the interaction of A' and A", which we may call A'", is 
A'" = 34 — ag + ag — ai 

Accordingly, the linear component is 2A' + A"\ the quadratic component 
is A , and the cubic component is — A! . For confounding we assume 
that the cubic component is negligible and confound A'^', this leaving us estimates 
of the linear and quadratic components. 

Reference to Section (c) will show that with four factors (all at two levels) 
being confoimded one first order interaction must be lost : we select A”' as this 
(equivalent to A' A"). The other two most suitable interactions for confounding 
are clearly A'CD and A" CD. 

The allocation of treatments to the four blocks proceeds in the usual manner, 
and is set out with the yields obtained in Table 14.6. 


Table 14.6 


Block 1 


(1) 24 

cd 31 

a'a"d 6 

a'a"c 14 


75 


Block 2 


a' 25 

a'cd 23 

a"d 24 

a"c 36 


108 


Block 3 


a" 56 

a"cd 41 

a'd 27 

a'c 40 


164 


Block 4 


c 48 

d • 29 

a'a"cd 23 

a'a" 25 


125 


The symbolism for treatments is such that, for example, a'a" implies both 
A' and A" at their upper levels ; reference to Table 14.5 shows that A\A'\ corre- 
sponds to a 4 , i.e, the highest level of the four-level factor. Factors C and D are 
of course at their lower levels. The treatment a"cd has C and D at their upper 
levels ; a" implies A'lA''^ (i.e. A' at its lower level and A" at its upper levd), and 
Table 14.5 gives this as ai. 

The last line in Table 14.6 gives the block totals, and the sum of squares 
for Blocks is of course 

1 (75* + 108* + 164* + 125*) — = 1028.500 

and this sum of squares has 3 degrees of freedom. 

The remainder of the analysis proceeds in the usual fashion. It is interesting 
to note that if we calculate the three interactions confounded, we get for their 
sums of squares — 

A'A" 324.000 

A'CD 2.250 

A"CD 702.250 

and this total agrees with that obtained above. 

If we wish to obtain a check on the correctness of the computation, we can 
calculate the Ughest order interaction, A'A"C D, directly rather than the usual 
manner of obtaining it by difference. We expand (a' — 1) (a" — 1) (c — 1) 
(d — 1 ) as a'a"c d -f* a'c + a'd -f- a"c 4 " a'a" -|- cd -|- 1 — a'cd — a"cd 

— a'a"c — a'a"d — a' — a"— c — d. 
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Going through the table of results (Table 14.6) we obtain as this total ( — 12). 
The sums of squares for A' A" CD is the square of this result divided by the number 
of observations concerned, or ( — 1 2)^/1 6 = 9.000. This checks with the figure 
found by difference from the total sum of squares. In any factorial experiment 
with factors at 2 levels it is open to us, if we prefer, to calculate all the sums of 
squares in this manner. (^) 

The table of analysis of variance is in Table 14.7. 


Table 14.7 


1 

Source of Variance 

Degree of 
Freedom 

Sums of 
Squares 

Blocks 

3 

1028.500 

. r Linear Component A' 

1 

702.250 

1 Quadratic Component A'' 

1 

30.250 

c 

1 

100.00 

D 

1 

256.000 

A'C 

1 

2.250 

A'D 

1 

12.250 

A"C 

1 

72.250 

A"D 

1 

6.250 

CD 

1 

36.000 

A'A"C 

1 

36.000 

A'A"D 

1 

25.000 

Residual 

1 

9.000 

Total 

1 

15 

2316.000 


The interpretation of this table is quite straightforward. The Residual, 
which of course is A'A"C D, can be pooled with the two second order interactions 
A'A''C and A'A"D to give a mean square of 23.333 with 3 degrees of freedom 
which we use for error. We then find all the first order interactions to be non- 
significant. Of the main effects, D is significant at the 1% level, C non-significant, 
the linear component of A is significant at the 0,1% level and the quadratic com- 
ponent of A, measuring the possible curvature, is non-significant. We can then 
proceed to calculate the relevant averages and attach the appropriate standard 
errors to them. 

It is noteworthy in Table 14.7 that the Mean Square for Blocks, 1028.000/3 
= 342.667, is highly significant (0.1%). With confounding the estimate of error 
obtained by pooling all the non-significant first order interactions is 18.178 (with 
7 degrees of freedom). If confounding had not been used, but instead the 16 
treatments allocated at random, then the sum of squares due to difference between 
blocks would have appeared in this pooled residual error, which would then have 
been 115.575 with 10 degrees of freedom. The gain in accuracy through the use 
of confounding, which reduced the error mean square to 18.178, is thus very 
great, and without it the experiment would have been almost valueless. 

Finally, it is of interest to note how such an experiment as this could have 
been tackled traditionally. The best that could have been done would be to put 
the four level factor A into two blocks, C and D being constant. The third block 
could have contained C replicated twice (two observations at each of the two 
levels), A and D being constant, and the fourth block could have been used similarly 

(^) A systematic method of making these calculations is given by F. Yates ‘Design and 
Analysis of Factorial Experiments” (Imperial Bureau of Soil Science), pages 15 and 28. 
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for D. Such an experiment would only give half the accuracy for estimating A* 
-md one quarter the accuracy for estimating C and D as the factorial confounded 
experiment. Further, it would have given no estimates at all of the possible 
existence of interactions. The gain through the use of the factorial confounded 
design is thus very considerable. 

(g) Partial Confounding 

It will be clear that if we cannot afford to lose any of the first order inter- 
actions,^ and the confounding that we are forced to adopt through the restricted 
block size is going to lose one or more of them, then it we replicate further we 
can choose to confound interactoins other than those which we lost in the first 
replication. This technique is known as “partial confounding.’' Details will be 
found in 

Yates : “Desi^ and Analysis of Factorial Experiments” (Imperial Bureau 
of Soil Science). Sections 4 and 5. 

Goulden : “Methods of Statistical Analysis” (John Wiley). Page 162. 

(h) Other Confounding Possibilities 

Yates (c.f. reference in preceding section) has set out the possibilities of 
confounding when factors at more than two levels are concerned. Amongst the 
schemes he describes are the confounding of the 3 X-3 X 3 and the 3 X 3 X 3 X3 
experiments in respectively 3 and 9 blocks of 9 runs. Mixed experiments of the 
type 3x2x2, 3x2x2x 2, 3x3x2, and 3x3x3x2all confounded 
in blocks of 6 runs are also described. 

(i) Double Confounding (®) 

Sometimes in a factorial experiment we have a double restriction imposed 
on the system. For example, in a 2® experiment (5 factors all at 2 levels, requiring 
a total of 32 runs) we may have 4 reactors which may be different. Accordingly 
we confound in 4 blocks of 8, there being 8 runs on each of the 4 reactors. How- 
ever, our batches of raw material may not be large enough to carry out the whole 
experiment. We then need to confoimd the experiment in an additional way. 
The problem is soluble if we make it the inverse of the first type of confounding. 
Thus, if the first confounding is in 4 blocks of 8 then the second must be in 8 blocks 
of 4. There is the further restriction that no interaction confounded in the 
second set must occur in the first set, and vice versa. 

For the confoimding in 8 blocks of 4 let us select for confounding 
PQ, RS, PRT, QST, QRT, PST, PQRS, 
and for the confounding in 4 blocks of 8 we can choose 
PQR, RST, POST, 

none of the second set occurring in the first. 

To allocate the treatments, we can first set out them in the usual manner for 
confounding in 8 blocks of 4, as in Table 14.8. 


Table 14.8 


Batch 1 

Batch 2 

Batch 3 

Batch 4 

Batch 5 

Batch 6 

Batch 7 

Batch 8 

(1) 

P 

q 

r 

s 

t 

pr 

qr 

pqrs 

qrs 

prs ^ 

pqs 

pqr 

pqrst 

qs 

ps 

rst 

prst 

qrst 

St 

rt 

rs 

p^ 

qst 

pqt 

qt 

pt 

pqrt 

pqst 

pq 

rqt 

prt 


(*) c.f. R. A. Fisher, “The Design of Experiments,” 3rd edition, p. 116. 
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We do likewise for the 4 blocks of 8, as in Table 14.9. 


Table 14.9 


Reactor 1 

(1) 

pq 

st 

prst 

qrt 

prt 

prs 

qrs 

Reactor 2 

P 

q 

pst 

qst 

pqrt 

rt 

rs 

pqrs 



pqs 

t 

pqt 

qrst 

prst 

Pr 

V 

Reactor 4 

r 

pqr 

rst 

pqrst 

qt 

Pt 

ps 

qs 


We now superimpose Tables 14.8 and 14.9 on top of each other. Treatment 
(1) belongs to Batch 1 and Reactor 1, and is entered in Table 14.10 accordingly. 
Treatment pqrs belongs to Batch 1 and Reactor 2, and is entesed thus. Treatment 
rst belongs to Batch 1 and Reactor 4, and so on. 


Table 14.10 



Batchi 

Batch 2 

Batch 3 

Batch 4 

Batch 5 

Batch 6 


BatchS 

Reactor i 

(1) 

qrs 

prs 

8t 

pq*t 

pq 

qrt 


Reactor 2 

pqrs 

P 

q 

pqrt 

rt 

rs 

pst 


Reactor 3 

pqt 

prst 

qrst 

pqs 

8 

t 

pr 

qr 

Reactor 4 

xst 

qt 

Pt 

r 

pqr 

pqrst 

qs 

P» 


Table 14.10 will be found to satisfy the condition of both Tables 14,8 and 
14.9, and accordingly is the correct allocation of treatments. The analysis of 
variance proceeds normally, all the interactions being confounded not being 
mentioned specifically in the table of analysis of variance but their Sums of Squares 
and Degrees of Freedom being lumped together under the description “Blocks.” 

Ti; will be noted that, if we were confounding in 8 blocks of 4, the imposition 
of the second restriction leads to no increase in the number of first order interactions 
being lost. 

Double confounding is not very practical in the 2* experiment, but works 
well in the 2® as described above, and also in the 2®, where there are two possi- 
bilities, either confoxmdmg in 8 blocks of 8 and 8 blocks of 8 or in 16 blocks of 4 
and 4 blocks of 16. 

For the former case, one set of 8 interactions for confounding in 
PQRS, RSTU, PQTU, PRT, QST, PSU and QRU 
and another non-overlapping set is 

PQRU, PRST, QSTU, PQS, RSU, RQT and PTU. 

Thus the double restriction can be applied without losing any first order 
interactions. 

For double confounding in 16 Blocks of 4 and in 4 blocks of 16, one set of 
interactions for confounding is 

PQ, RU, PRT, QST, QRT, PST, PQRS, TU, 

PQTU, RSTU, PRU, QSU, QRU, PSU, PQjRSTU, 
and the other set, non-overlapping, is 
PQRU, PRST, QSTU, 

Accordingly, if we are confounding in 16 Blocks of 4, we lose 3 first order 
interactions, and the introduction of the second restriction in no way increases 
the loss, so can always be carried out when convenient or necessary. 
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CHAPTER XV 


GENERAL CONCLUSIONS 
(2l) Investigation of Multi- Variable Processes 

The reader vdll have noticed that we have discussed two diflerent techniques 
for the investigation of multi-variable S5rstenis, namely multiple correlation and 
the factorial experiment. 

Multiple correlation is appropriate where we have a considerable mass of 
data on the system under investigation, the various independent variables being 
at random and at all levels. Such data arises from the normal running of a plant : 
in particular, on a modem plant with automatic recorders kept in proper working 
condition such data should be very easily collected. 

The factorial experiment is appropriate where we can make a deliberate 
experiment upon a system, controlling the independent variables at the levels 
laid down by the factorial design. The execution of such an experiment generally 
calls for careful supervision and hence tends to be relatively costly. 

The multiple correlation approach suffers from the disadvantage that initially 
one is restricted to the assumption that the dependent variable is a linear function 
of the independent variables which are also assumed to be acting independently. 

It is, however, possible to evade some of these difficulties arising from non- 
linearity by use of the various forms of transformation mentioned in Chapter 
IX (b) and Chapter X (b). It will be noted that the satisfactory use of ihese 
devices is based on our initially selecting the appropriate transformation, either 
from physico-chemical knowledge of the system, or from an intelligent or lucky 
inspection of the data. If we have to try first one transformation, then another 
and so on till we get a good fit the amount of work is excessive. 

The dfficulty with regard to the independence of the independent vambles 
can be partially circumvented by the addition of product terms as additional 
independent variables, e.g, if our dependent variable is y and we have independent 
variables Xi and x* we can include the third variable (xi x*), i.e. we fit an equation 
of the form 

y = bi Xi + b, X, + bi Xf) . , , 

However, if there are three independent variables and we wish to cnecK on the 
possible interactions of all of them, the equation we fit is 

y = bi Xi + bj Xj -h bs Xa + bx 1 (xi Xf) + ba a (xi Xa) + ba i (Xa Xj) 
The labour of calculating all the relevant sums of squares and sums of products 
and then solving the six simultaneous equations would be very heavy and only 
in very important cases could it be justified. 

' « With the factorial experiment, our initial assumptions ve^ gaieral and 

we are quite prepared to ‘see any of the first order interactions significant, and 
we have made no assumptions as to linearity. This generahty ineans that a factorial 
experiment could only give confusing results if every factor was interactmg strongly 
with all the others, a very rare occurrence, and even then one which could probably 
be made clearer by a logarithmic transformation. Further, if some of our factors 
are at more than two levels we can readily check on the possible non-lineanty of 
the effects by the methods of Chapter XII (c). The total computation required 
is very much less than with multiple correlation. 

The factorial hag. fetiw a^^titage &at it c^ generally be 

appreciably m accuracy by the device of confounding. 

Taking ah bretall yieWi there cah be no ddiibt that the factorial experiinent 
is to be greatly preferred to an examination of existing data by mutUple corr^tion. 
In addition to all the adv^ges set out above, the experiment has the further 
advantage that it is not liable to error through the presence of unsuspected corre- 
lated variables: this is the distinction between experiment and observation 
discussed in Chapter IX (k). 
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In the intermediate case, where some variables can be controlled at the levels 
laid down by a factorial design but where some of them are more difficult to 
control, the most appropriate technique would be covariance : there are no 
published references referring to industrial problems but the technique is described 
in the references below. (^) 

(b) The Advantages of Planning Experiments 

It will have been apparent that none of these statistical designs for experi- 
ments can be applied without the appropriate forethought and planning. They 
cannot be superimposed on a hotch-potch of random results. 

The implication of this are rather far-reaching. For example, the traditional 
style of experimenting by trying first one thing, then if that does not work then 
something else, and so on, in fact the policy of proceeding by small steps, is quite 
unsuitable. On the contrary, we require to sit down and appraise all the points 
that appear to be of interest, or likely to be of interest in the future, and then 
experiment on them all simultaneously. We thus move in jumps rather than in 
steps. The great advantage of moving in jumps is that the information given 
by a jump adds up to very much more than that given by its component steps, 
though requiring no more effort. 

To take an example of this, consider the case discussed in Chapter XIII (a) 
where we tested 7 treatments in blocks of 3. The variance of the comparison of 
any two corrected means was (6/7) ak\ = 0.857ak^, where ok* was the error 
variance of a single observation within blocks. 

Now suppose that instead of starting off the experiment with the full number 
of treatments (7) we had begun with only 4 and had afterwards decided to test 
the remaining 3. Our first 4 could be tested with the design given in Table 13.3, 
when the variance of the comparison of any two of these means would be 

•7 V ^ ft ~ ^) _ a _ 2 V 3 (4 — 1) _ 

^ ^ N (k— 1) ^ ^ 3 X 4 (3 — 1) 

This compares favourably with the figures of 0.857ak2 for the whole experiment. 
However, when we come to the 3 additional treatments, it is impossible to execute 
them in such a manner that when added to the existing data we get the same 
design as that used for testing 7 treatments. Accordingly the best we can do is 
to select one treatment as a ‘‘standard’" (say treatment T 4 ) to occur in both experi- 
ments, and with this standard and the remaining 3 treatments we carry out a 
further balanced incomplete block of 4 treatments in blocks of 3, requiring 3 
replications. The error variance of the comparison of any two of the four treat- 
ments T4 to T 7 in 0.75<Tk^ as it was for any two of the four treatments Ti to T4. 
However, when we wish to compare any of the three treatments Tj to Tg with 
any of the three Tg to T?, this can only be done via T 4 , and the error variance 
of these comparisons is 2 X 0.75ak® = l.ScrkS nearly double that of 0.857<Tk* for 
the experiment in one stage. The overall average variance of a comparison in 
the two stage experiment is actually 1.071ak^. 

When we consider that the experiment in two stages has taken 24 runs as 
compared with 21 for the experiment in one stage, it is evident that the latter 
has been much more satisfactory. 

In parenthesis, a comparison might be made with the classical type of experi- 
ment using a single standard throughout. The nearest approach to a balanced 
design calls for 6 blocks, all containing the standard and two of the remaining 6 
treatments. Each of the latter therefore occurs twice. The total number of runs 

(^) For a description of the method of covariance see R. A. Fisher : ^‘Statistical Methods 
for Research Workers/^ Section 49.1 (Oliver and Boyd). K. Mather : “Statistical Analysis 
in Biology/* Section 34 (Methuen). G. W. Snedecor : “Statistical Mefiiods,” Chapter 12 
(Iowa State College Press). 
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Tequired is 6 X 3 = 18, which is sUghtly less than the 21 for the balanced incom- 
plete blocte. The accuracy is very much less. There are three separate types 
of companson m this classical design : — ^ 

(a) Between any treatment and the standard. These have variances 0.667ofc* 

(b) Between any two treatments which occur in the same block. These have 
variances 1.333ark^ 

(c) Between any two treatments not in the same block. These have variances 
^proximately 4.0ok2. The overall average is approximately 2.285ak*. 
The comparison of this classical design with a “standard” with the 
balanced incomplete block is thus seen to be very unfavourable to the - 
former, when it is remembered that in the latter all comparisons have the 
variance O.SSTok^ 

In the same manner, it can be shown that one large factorial experiment is 
.much more satisfactory than two smaller ones. For example, with 6 factors all 
at 2 levels, it is rnuch better to have then in one full experiment requiring 2* = 64 
mns(*) than to divide the experiment into two parts, each with 3 factors, requiring 
for the two parts a total of 2 x 2® = 16 runs. The former requires appreciably 
more work, but in return yields the following advantages : — 

(a) The main effects are determined with 8 times the accuracy. 

(b) All the 15 possible first order interactions are evaluated with considerable 
accuracy ; the two stage experiment only estimates 6 of these 15, and 
then only with one-eighth the accuracy. 

(c) All the 20 possible second order interactions are estimated : in the two 
stage experiment only 2 are estimated, and then these estimates have to 
be used as error so are of no value as regards estimating the interactions 
themselves. 

(d) All the 15 possible third order interactions are estimated. There are 
still 7 degrees of freedom left for error : if we use these plus the greater 
part of the 15 third order interactions for error we will have the latter 
with about 20 degrees of freedom. This is much more satisfactory than 
in the two-stege experiment where we start off with a second order 
interaction with only 1 degree of freedom as error. 

(c) Conclusions 

These sections on the design of experiments have been written rather as 
though it were a fully-developed subject like Euclid’s Geometry. This is not the 
case, however : these statistical designs have hitherto only been employed in the 
chemical indxistry to a very limited extent, their development taking place in the 
science of agriculture, where there is now a body of experience available stretching 
over twenty years. The methods are clearly capable of transplantation on the 
lines indicated, but we must guard against an imdue literalism in their application. 
In the chemical industry certain aspects will need to be developed, and others 
perhaps to be modified as not suitable in the changed conditions. Thus, for 
example, it seems that for the presentation of final results oh which action may 
be taken the 5% level of significance is inadequate, and we really require the 
1% or better. In general, therefore, caution should be used in their interpretation 
till there is a greater foundation of experience, but to date no incident 
has come to the author’s notice which suggests that there are any pitfalls. 


(•) Methods have been developed recently for factorial experiments with large numbers 
of factors which allow us to obtain satisfactory estimates of the main effects and lower 
order (first and second) interactions with only a fraction of the fiill number of runs ; c.f. 
D. J. Finney, Annals of Eugenics, Volume 12, page 291, 1945. 
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TABLE II 


Table of 


Degrees 

of 

Freedom 

- ^ 

0.99 

0.98 

0.95 

0.90 

0.50 

flQUI 

o.os 

0.02 

0.01 

0.001 

1 

0.000 

0.001 

0.004 

0.015 

0.455 

2.71 

3.84 

5.41 

6.64 

10.83 

2 

0.020 

0.040 


0.211 

1.386 

4.61 

5.99 

7.82 

9.21 

13.82 

3 

0.115 

0.185 


0.584 

2.366 

6.25 

7.82 

9.84 

11.34 

16.27 

4 

0.297 

0.429 



3.357 

7.78 

9.49 

11.67 

13.28 

18.47 

5 

0.554 

0.752 

1.145 


4.351 

9.24 

11.07 

13.39 

15.09 

20.52 

6 

0.872 

1.134 

1.635 

2.204 

5.35 


12.59 

15.03 

16.81 

22.46 

7 

1.239 

1.564 

2.167 

2.833 

6.35 


BEm 

16.62 

18.48 

24.32 

8 

1.646 

2.032 

2.733 

iSESl 

7.34 

13.36 

15.51 

18.17 

20.09 

26.13 

9 

2.088 

2.532 

3.325 

4.168 

8.34 

14.68 

16.92 

19.68 

21.67 

27.88 

10 

2.558 

3.059 

3.940 

4.865 

9.34 

15.99 

18.31 

21.16 


29.59 

11 

3.05 

3.61 

4.57 

5.58 

10.34 

17.28 

19.68 

22.62 

24.73 

31.26 

12 

3.57 

4.18 

5.23 

6.30 

11.34 

18.55 


24.05 

26.22 

32.91 

13 

4.11 

4.76 

5.89 

mESM 

12.34 

19.81 

22.36 

25.47 

27.69 

34.53 

14 

4.66 

5.37 

6.S7 

7.79 

13.34 

21.06 

23.69 

26.87 

29.14 

36.12 

15 

5.23 

5.99 

7.26 

8,55 

14.34 

22.31 

25.00 

28.26 

30.58 

37.70 

16 

5.81 

6.61 

7.96 

9.31 

15.34 

23.54 

26.30 

39.63 

32.00 

' 39.25 

17 

6.41 

7.26 

8.67 


16.34 

24.77 

27.59 


33.41 

40.79 

18 

7.02 

7.91 

9.39 


17.34 

25.99 

28.87 

32.35 

34.81 

I 42.31 

19 

7.63 1 

8.57 

10.12 

11.65 

18.34 

27.20 

30.14 

33.69 

36.19 

43.82 

20 

8.26 

9.24 i 

10.85 

12.44 

19.34 

28.41 

31.41 

35.02 

37.57 

45.32 

21 

8.90 

9.91 

11.59 

13.34 

20.34 

29.61 

32.67 

36.34 

38.93 

46.80 

22 

9.54 

10,60 

12.34 

mimm 

21.34 

30.81 

33.92 

37.66 

40,29 

48.27 

23 

10.20 

11.29 

13.09 

14.85 

22.34 

32.01 

35.17 

38,97 

41.64 

49.73 

24 

10.86 

11.99 

13.85 

15.66 

23.34 

33.20 

36.42 

40.27 

42.98 

51.18 

25 

11.52 

12.70 

14.61 

16.47 

24.34 

34.38 

37.65 

41.57 

44.31 

52.62 

26 

12.20 

13.41 

15.38 

17.29 

25.34 

35.56 

38.89 

42.86 

45.64 

54.05 

27 

12.88 

14.12 

16.15 

18.11 

26.34 

36.74 

40.11 

44.14 

46.96 

55.48 

28 

13.56 

14.85 

16.93 

18.94 

27.34 

37.92 

41.34 

45.42 

48.28 

56.89 

29 

14.26 

15.57 

17.71 

19.77 

28.34 

39.09 

42.56 

46.69 

49.59 

58.30 

30 

14.95 

16.31 

18.49 

20.60 

29.34 

40.26 

43.77 

47.96 

50.89 

59.70 


Abridged from Table IV 
Medical Research.” 


of “Statistical Tables for Biological, Agricultural and 
(R. A. Fisher and F. Yates : Oliver and Boyd). 
























TABLE III 

Tables of Variance Ratio (i) 


\ Nx 

0.20 Significance Level 


B 

2 

3 

4 

5 

6 

12 

24 


1 

9.5 

12.0 

13.1 

13.7 

14.0 

14.3 

14,9 

15.2 

15.6 

2 

3.6 

4.0 

4.2 

4.2 

4.3 

4.3 

4.4 

4.4 

4.5 

3 

2.7 

2.9 

2.9 


3.0 

3.0 

3.0 

3.0 

3.0 

4 

2,4 

2.5 

2.5 

2.5 

2.5 

2.5 

2.5 

2.4 

2.4 

5 

2.2 

2.3 

2.3 

2.2 

2.2 

2.2 

2.2 

2.2 

2.1 

6 

2.1 

2.1 

2.1 

2.1 

2.1 

2.1 

2.0 

2.0 

2.0 

7 

2.0 

2.0 

2.0 

2.0 

2.0 

2.0 

1.9 

1.9 

1.8 

8 

2.0 

2.0 

2.0 

1.9 

1.9 

1.9 

1.8 

1.8 

1.7 

9 

1.9 

1.9 

1.9 

1.9 

1.9 

1.8 

1.8 

1.7 

1.7 

10 

1.9 

1.9 

1.9 

1.8 

1.8 

1.8 

1.7 

1.7 

1,6 

11 

1.9 

1.9 

1.8 

1.8 

1.8 

L8 

1.7 

1.6 

1.6 

12 

1.8 

1.8 

1.8 

1.8 


1.7 

1.7 

1.6 

1.5 

13 

1.8 

1.8 

1.8 

1.8 


L7 

1.6 

1.6 

1.5 

14 

1.8 

1.8 

1,8 

1.7 

■B 

1.7 

1.6 

1.6 

1.5 

15 

1.8 

1.8 

1.8 

1.7 

1.7 

1.7 

1.6 

1.5 

1.5 

16 

1.8 

1.8 

mt 

1.7 

1.7 

1.6 

1.6 

1.5 

1.4 

17 

1.8 

1.8 

mSM 

1.7 

1.7 

1.6 

1.6 

1.5 

1.4 

18 

1.8 

1.8 

1,7 

1.7 

1.6 

L6 

1.5 

1.5 

1.4 

19 

1.8 

1.8 

1.7 

1.7 

1.6 

L6 

1.5 

1.5 

1.4 

20 

1.8 

1.8 

1.7 

1.7 

1.6 

L6 

1.5 

1.5 

1.4 

22 

1.8 

1.7 ' 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.4 

24 

1.7 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.3 

26 

1.7 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.3 

28 

1.7 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.4 

1.3 

30 

1.7 

1.7 

1.6 

1.6 

1.6 

1.5 

1.5 

1.4 

L3 

40 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

1.4 

1.2 

60 

1.7 

1.7 

1.6 

1.6 

1.5 

1.5 

1.4 

1.3 

1.2 

120 

1.7 

1.6 

1.6 

1.5 

1.5 

1.5 

1.4 

1.3 

1.1 

00 

1.6 

1.6 

1.6 

1.5 

1.5 

1.4 

1,3 

1.2 

1.0 


Abridged from Table V of ‘‘Statistical Tables for Biological, Agricultural and 
Medical Research.” (R. A. Fisher and F, Yates : Oliver and Boyd). 
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Table of Variance Ratio (ii) 


Ni 

N, 

0.05 Significance Level 

1 

2 

3 

4 

5 

6 

12 

24 

00 

1 

164,4 

199.5 

215.7 

224.6 

230.2 

234.0 

234.9 

249.0 

254.3 

2 

18.5 

19.2 

19,2 

19.3 

19.3 

19.3 

19.4 

19.5 

19.5 

3 

10.1 

9.6 

9.3 

9.1 

9.0 

8.9 

8,7 

8.6 

8.5 

4 

7.7 

6.9 

6,6 

6.4 

6.3 

6.2 

5.9 

5.8 

5.6 

5 

6.6 

5.8 

5.4 

5.2 

5.1 

5.0 

4.7 

4.5 

4.4 

6 


5.1 

4.8 

4.5 

4.4 

4.3 


3.8 

3.7 

7 

5.6 

mm 

4.4 

4.1 

4.0 

3.9 

3.6 

3.4 

3.2 

8 

5.3 


4.1 

3.8 

3.7 

3.6 

3.3 

3.1 

2.9 

9 

5.1 


3.9 

3.6 

3.5 

3.4 

3.1 

2.9 

2.7 


5.0 


3.7 

3.5 

3,3 

3.2 

2.9 

2.7 

2.5 

11 

4.8 


3.6 

3.4 

3.2 

3.1 

2.8 

2.6 

2.4 

12 

4.8 

3.9 

3.5 

3.3 

3.1 

3.0 

2.7 

2.5 

2.3 

13 

4.7 

3.8 

3.4 

3.2 

3.0 

2.9 

2.6 

2.4 

2.2 

14 

4.6 

3.7 

3.3 

3.1 

3.0 

2.9 

2.5 

2.3 

2.1 

15 

4.5 

3.7 

3.3 

3.1 

2.9 

2.8 

2.5 

2.3 

2.1 

16 

4.5 

3,6 

3.2 

^^9 

2.9 

2.7 

2.4 

' 2.2 

2.0 

17 

4.5 

3.6 

3.2 


2.8 

2.7 

2.4 

2.2 

2.0 

18 

4.4 

3.6 

3.2 

2.9 

2.8 

2.7 

2.3 

2.1 

1.9 

19 

4.4 

3.5 

3.1 

2.9 

2.7 

2.6 

2.3 

2.1 

1.9 

20 

4.4 

3.5 

3,1 

2.9 

2.7 

2.6 

2.3 

2.1 

1.8 

22 

4.3 

3.4 

3.1 

2.8 

2.7 

2.6 

2.2 


1.8 

24 

4.3 

3.4 

3.0 

2.8 

2.6 

2.5 

2.2 

2.0 

1.7 

26 

4.2 

3.4 

3.0 

2.7 

2.6 

2.5 

2.2 

2.0 

mm 

28 

4.2 

3.3 

3.0 

2.7 

2.6 

2.4 

2.1 

1.9 

mSm 

30 

4.2 

3.3 


2.7 

2.5 

2.4 

2.1 

1,9 

1.6 

40 

4.1 

3.2 


2.6 

2.5 

2.3 


1.8 


60 

4.0 

3,2 

2.8 

2.5 

2.4 

2.3 

1.9 

1.7 

■El 

120 

3.9 

3.1 

2.7 

2.5 

2.3 

2.2 

1.8 

1.6 

mlwm 

00 

3.8 

3.0 

2.6 

2.4 

2.2 

2.1 

1.8 

1.5 

1 


Abridged from Table V of "Statistical Tables for Biological, Agricultural, and 
Medical Research.** (R. A. Fisher and F. Yates : Oliver and Boyd). 
























Tables of Variance Ratio (111) 


N, 

0.01 Significance Level 

1 

2 

3 

4 

5 

6 

8 

12 

24 

00 

1 


4999 

5403 

5625 

5764 

5859 

5981 

6106 

6234 

6366 

2 

98.5 


99.2 

99.3 

99.3 

99.4 

99.3 

99.4 

99.5 

99.5 

3 

34.1 


29.5 

28.7 

28.2 

27.9 

27.5 

27.1 

26.6 

26.1 

4 

21.2 

18.0 

16.7 

16.0 

15.5 

15.2 

14.8 

14.4 

13.9 

13.5 

5 

16.3 

13.3 

12.1 

11.4 

11.0 

10.7 

10.3 

9.9 

9.5 

9.0 

6 

13.7 

10.9 

9.8 

9.2 

8.8 

8.5 

8.1 

7.7 

7.3 

6.9 

7 

12.3 

9.6 

8.5 

7.9 

7.5 

7.2 

6.8 

6.5 

6.1 

5.7 

8 

11.3 

8.7 

7.6 

mjm 

6.6 

6.4 

6.0 

5.7 

5.3 

4.9 


10.6 

8.0 

7.0 


6.1 

5.8 

5.5 

5.1 

4.7 

4.3 

BH 

10.0 

7.6 

6.6 

6,0 

5.6 

5.4 

5.1 

4.7 

4.3 

3.9 



7.2 

6.2 


5.3 

5.1 

4.7 

4.4 

4.0 

3.6 



6.9 

6.0 


5.1 

4.8 

4.5 

4.2 

3.8 

3.4 


9.1 

6.7 

5.7 


4.9 

4.6 

4.3 

4.0 

3.6 

3.2 



6.5 

5.6 

5.0 

4.7 

4.5 

4.1 

3.8 

3.4 

3.0 

15 

8.7 

6.4 

5.4 

4.9 

4.6 

4.3 

4.0 

3.7 

3.3 

2.9 

16 

8.5 

6.2 

5.3 

4.8 

4.4 

4.2 

3.9 

3.6 

3.2 

2.8 

17 

8.4 

6.1 

5.2 

4.7 

4.3 

4.1 

3.8 

3.5 

3.1 

2.7 

18 

8.3 

6.0 

5.1 

4.6 

4.3 

4.0 

3.7 

3.4 

3.0 

2.6 

19 

8.2 

5.9 

5.0 

4.5 

4.2 

3.9 

3.6 

3.3 

2,9 

2.5 

20 

8.1 

5.9 

4.9 

4.4 

4.1 

3.9 

3.6 

3.2 

2.9 

2.4 

22 

7.9 

5.7 

4.8 

4.3 

4.0 

3.8 

3.5 

3.1 

2.8 

2.3 

24 

7.8 

5.6 

4.7 

4.2 

3.9 

3.7 

3.3 

3.0 

2.7 

2.2 

26 

7.7 

5.5 

4.6 

4.1 

3.8 

3.6 

3.3 

3.0 

2.6 

2.1 

28 

7.6 

5.5 

4.6 

4.1 

3.8 

3.5 

3.2 

2.9 

2.5 

2.1 

30 

7.6 

5.4 

4.5 

4.0 

3.7 

3.5 

3.2 

2.8 

2.5 

2.0 

40 

7.3 

5.2 

4.3 

3.8 

3.5 

3.3 

3.0 

2.7 

2.3 

1.8 

60 

7.1 

5.0 

4.1 

3.7 

3.3 

3.1 

2.8 

2.5 

2.1 

1.6 

120 

6.9 

4.8 

4.0 

3.5 

3.2 

3.0 

2.7 

2.3 

2.0 

1.4 

00 

6.6 

4.6 

3.8 

3.3 

3.0 

2.8 

2.5 

2.2 

1.8 

1.0 


Abridged from Table V of “Statistical Tables for Biological, Agricultural and 
Medical Research.” (R, A. Fisher and F. Yates : Oliver and Boyd). 
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Table of Variance Ratio (iv) 






0.001 Significance Level 




N« 

1 

2 

3 

n 

5 

6 

8 

12 

24 

00 

1 

2 

998 

varyinj 

999 

from 40 

999 

),000 to < 

999 


999 

999 

999 

999 

999 

3 

167 

148 

141 

137 

135 

133 

131 

128 

126 

123 

4 

74.1 

61.3 

56.2 

53.4 

51.7 

MiMi 

EEffil 

47.4 

45.8 

44.1 

5 

47.0 

36.6 

33.2 

31.1 

29.8 

28.8 

27.6 

26.4 

25.1 

23.8 

6 

35.5 

27.0 

23.7 

21.9 




18.0 

16.9 

15.8 

7 

29.2 

21.7 

18.8 

17.2 

16.2 

15.5 

14.6 

13.7 

12.7 

11.7 

8 

25.4 

18.5 

15.8 

14.4 

13.5 

12.9 

12.0 

11.2 

10.3 

9.3 

9 

22.9 

16.4 

13.9 

12.6 

11.7 

11.1 

lEsa 

9.6 

8.7 

7.8 

10 

21.0 

14.9 

12.6 

11.3 

10.5 

9.9 

9.2 

8.5 

7.6 

6,8 

11 

19.7 

13.8 

11.6 

10.4 

9.6 

9.1 

8.3 

7.6 

6.9 

6.0 

12 

18.6 

13.0 

10.8 

9.6 

8,9 

8.4 

7.7 

mtm 

6.3 

5.4 

13 

17.8 

12.3 

10.2 

9.1 

8.4 

7.9 

7.2 

6.5 

5.8 

5.0 

14 

17.1 

11.8 

9.7 

8.6 

7.9 

7.4 

6.8 

6,1 

5.4 


15 

16.6 

11.3 

9.3 

8.3 

7.6 

7.1 

6.5 

5.8 

5.1 

la 

16 

^6.1 

11.0 

9.0 

7.9 

7,3 

6.8 

6.2 

5.6 

4.9 

4.1 

17 

15.7 

10.7 

8.7 

7,7 

7.0 

6.6 

6.0 

5.3 

4.6 


18 

15.4 

10.4 

8.5 

7.5 

mmm 

6.4 

5.8 

5.1 

4.5 


19 

15.1 

10.2 

8.3 

7.3 


6.2 

5.6 

MW 

4.3 


20 

14.8 

10.0 

8.1 

7.1 


6.0 

5.4 

4.8 

4.2 


22 

14.4 

9,6 

7.8 

6.8 

^9 

5.8 

5.2 

4.6 

3.9 


24 

14.0 

9.3 

7.6 

6.6 

6.0 

5.6 

5.0 

4.4 

3.7 

3.0 

26 

13.7 

9,1 

7.4 

6.4 

5.8 

5.4 

4.8 

4.2 

3.6 

2.8 

28 

13.5 

8.9 

7.2 

6.3 

5.7 

5.2 

4.7 

4.1 

3.5 

2.7 


13.3 

8.8 

7.1 

6.1 

5.5 

5.1 

4.6 

4.0 

3.4 

2.6 

40 

12.6 

8.2 

6.6 

5.7 

5.1 

4.7 

4.2 

3.6 

3.0 

2.2 

60 

12.0 

7.8 

6.2 

5.3 

4.8 

4.4 

3.9 

3,3 

2.7 

1.9 


11.4 

7.3 

5.8 

5.0 

4.4 

4.0 

3.5 

3.0 

2.4 

1.6 

GO 


6.9 

5.4 

5.6 

4.1 

3.7 

3.3 

2.7 

2.1 

1.0 


Abridged from Table V of “Statistical Tables for Biological, A^icultural and 
Medical Research.” (R. A. Fisher and F. Yates : Oliver and Boyd). 
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TABLE IV 


Table of the Correlation Coefficient 


Degrees 

of 

Freedom 

r 

0.10 

0.05 

0.02 

0.01 

0.001 

1 

.988 

.997 

. .999 

1.000 

1.000 

2 

.900 

.950 

.980 

.990 

.999 

3 

.805 

.878 

.934 

.959 

.992 

4 

.729 

.811 

.882 

.917 

.974 

5 

.669 

.754 

.833 

.874 

.951 

6 

.621 

.707 

.789 

.834 

.925 

7 

.582 

.666 

.750 

.798 

.898 

8 

.549 

.632 

.716 

.765 

.872 

9 

.521 

.602 

.685 

.735 

.847 

10 

.497 

.576 

.658 

.708 

.823 

11 

.476 

.553 

.634 

.684 

.801 

12 

.457 

.532 

.612 

.661 

.780 

13 

.441 

.514 

.592 

.641 

.760 

14 

.426 

.497 

.574 

.623 

.742 

15 

.412 

.482 

.558 

.606 

.725 

16 

.400 

.468 

.543 

.590 

.708 

17 

.389 

.456 

.528 

.575 

.693 

18 

.378 

,444 

.516 

.561 

.679 

19 

.369 

.433 

.503 

.549 

.665 

20 

.360 

.423 

.492 

.537 

.652 

25 

.323 

.381 

.445 

.487 

.597 

30 

.296 

.349 

.409 

.449 

.554 

. 35 

.275 

.325 

.381 

.418 

.519 

40 

.257 

.304 

.358 

.393 

.490 

45 

.243 

.287 

.338 

.372 

.465 

50 

.231 

.273 

.322 

.354 

.443 

60 

.211 

.250 

.295 

.325 

.408 

70 

.195 

.232 

.274 

.302 

.380 

80 

.183 

.217 

.256 

.283 

.357 

90 

.173 

.205 

.242 

.267 

.337 

100 

.164 

.195 

.230 

.254 

.321 


Abridged from Table VI of “Statistical Tables for Biojogical, Agricultural and 
Medical Research.” (R. A. Fisher and F. Yates : Oliver and Boyd). 
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TABLE V 


Factors for Control Charts 


Number in 
Sample 

A' 

0.025 

A' 

0.001 

D" 

0.975 

D' 

0.999 

d 

n 

2 

1.229 

1.937 

2.81 

4.12 

1.13 

3 

0.668 

1.054 

2.17 

2.98 

1.69 

4 

0.476 

0.750 

1.93 

2.57 

2.06 

5 

0.377 

0.594 

1.81 

2.34 

2.33 

6 

0.316 

0.498 

1.72 

2.21 

2.53 

7 

0.274 

0.432 

1.66 

2.11 

2.70 

8 

0.244 

0.384 

1.62 

2.04 

2.85 

9 

0.220 

0.347 

1.58 

1.99 

2.97 

10 

0.202 

0.317 

1.56 

1.93 

3.08 


Abridged from B, S. 600R “Quality Control Charts”. (B. P. 

W. J. Jennett. British Standards Institution.) 


Dud ding and 


Printed in the U.S.A. by DORAY PRESS, NEW YORK, N. Y, 


151 














106 730 




r 


